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Abstract. 

We review the statistical mechanics approach to the study of the emerging collective 
behavior of systems of heterogeneous interacting agents. The general framework 
is presented through examples is such contexts as ecosystem dynamics and traffic 
modeling. We then focus on the analysis of the optimal properties of large random 
resource-allocation problems and on Minority Games and related models of speculative 
trading in financial markets, discussing a number of extensions including multi-asset 
models, Majority Games and models with asymmetric information. Finally, we 
summarize the main conclusions and outline the major open problems and limitations 
of the approach. 
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1. Introduction 

Collective phenomena in economics, social sciences and ecology are very attractive 
for statistical physicists, especially in view of the empirical abundance of non-trivial 
fluctuation patterns and statistical regularities - think of returns in financial markets or 
of allometric scaling in ecosystems - which pose intriguing theoretical challenges. On an 
abstract level, the problems at stake are indeed not too different from, say, understanding 
how spontaneous magnetization may arise in a magnetic system, since what one wants 
in both cases is to understand how the effects of interactions at the microscopic scale 
can build up to the macroscopic scale. Clearly, ecologies or financial markets are quite 
more complex systems than magnets, being composed of units which themselves follow 
complex (and far from understood) behavioral rules. Still, in many cases it may be 
reasonable to assume that the collective behavior of a crowd of individuals presents 
aspects of a purely statistical nature which might be appreciated already in highly 
styhzed models of such systems. This is ultimately the rationale for applying statistical 
mechanics to such problems. 

In general, statistical physics offers a set of concepts (e.g. order parameters and 
scaling laws) and tools (both analytical and numerical) allowing for a characterization in 
terms of phases and phase transitions which might be useful in shaping the way wc think 
about such complex systems. The considerable progress achieved in the last decades 
in the statistical mechanics of non-equilibrium processes and of disordered systems, 
thanks to which it is now possible to deal effectively with fluctuations and heterogeneity 
(respectively) in systems with many interacting degrees of freedom, is particularly 
important for socio-economic applications. In fact, while equilibrium and homogeneity 
are important in physics, non-equilibrium and heterogeneity are the rule in economics, as 
each individual is different both in his/her characteristics and in the way he/she interacts 
with the environment. Deriving general macroscopic laws taking the specific details of 
each and every individual's behavior into account is a desperate task. However, as long 
as one is interested in collective properties, a system with complicated heterogeneous 
interactions can be reasonably well represented as one with random couplings [1]. In 
the limit of infinite system size, some of the relevant macroscopic observables will be 
subject to laws of large numbers, i.e. some quantities will be self- averaging, and, if 
the microscopic dynamics follows sufficiently simple rules, one may hope to be able to 
calculate them explicitly. It is with these properties - which we call typical - that the 
statistical mechanics approach is concerned. 

In what follows, we shall mostly concentrate on problems arising in economics 
and finance. When modeling these systems one must be aware that their microscopic 
behavior is very different from that governing particles or atoms in physics. Economic 
agents typically respond to incentives and act in a selfish way. This is usually modeled 
assuming that individuals strive to maximize their private utility functions, with no 
regard for social welfare. Not only agents might have confiicting goals, as their utility 
functions will in general be different, but their selfish behavior may lead to globally 
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inefficient outcomes - e.g. to a coordination failure or to a lack of cooperation. Such 
outcomes, called Nash equilibria in Game Theory, are in general different from socially 
optimal states where the total utility is maximized. Hence, generally, in a system of 
interacting agents there is no global energy function to be minimized. 

Another important difference between the dynamics of a physical system, such as 
a magnetic material, and that of an economic system is that, while in the former spins 
at a particular time depend at most on the past states of the system, in the latter the 
agents' choices also depend on the expectations which they harbor about the future 
states. This suggests that the collective dynamics may have a non-causal component 
(indeed, backward induction in time plays a big role in the strategic reasoning of rational 
agents [2]). In many cases, however, it is reasonable to assume that agents are boundedly 
rational or 'inductive', i.e. that their behavior as well as their expectations adjust as a 
result of experience. We shall concentrate our analysis to these cases of adaptive agents 
following a learning dynamics. We shall see that the lack of a global Hamiltonian is 
reflected in the fact that such a dynamics, in general, violates detailed balance. 

Actually, in many cases it is realistic to assume that agents behave as if they were 
interacting with a system as a whole - be it a market or the crowd - rather than 
directly with a number of other individuals. In economics, this is termed a price-taking 
assumption, because it amounts to stating that agents act as if prices do not depend 
on what they actually decide to buy or sell (i.e. they take prices as given), and it is 
usually justified by saying that the contribution of a single agent to the total demand is 
negligible when the number of agents is large. The equilibria of systems where agents 
behave as price-takers are called competitive equilibria. However, prices depend on the 
aggregate demand and supply and hence on the choice of each agent, and the statistical 
physics approach provides a very transparent description of how price-taking behavior 
modifies the global properties of a system. 

This review gives a survey of some recent quantitative developments on the 
statistical mechanics of systems of many interacting adaptive agents. This is a subject 
that has been shaped over the past few years around a few basic models (like the El 
Farol problem) and a few analytical techniques, mostly borrowed from the mean-field 
theory of spin glasses (like the replica method). The models, though highly stylized 
to an economist's eyes, possess a strong physical content and in many cases provide 
important indications as to whether the phenomenology of real systems is specific of 
each of their particular natures or rather it is generic of large systems of adaptive units 
interacting competitively. Ultimately, it is not too unfair to say that separating system- 
specific features from general features can be seen as the main contribution statistical 
physics can provide to this field (besides techniques). 

Our choice of arguments is clearly biased, and the reader may dispose of several 
recent books that cover some of the important issues (especially finance-inspired) we 
merely touch here [3-7]. Along with a core of problems related to the emergence of non- 
trivial fluctuation phenomena, cooperation and efficiency (understanding which has been 
the original goal of these studies) , other issues such as the impact of different information 
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structures or the interaction between different multi-agent systems have just started to 
be analyzed and are likely to attract a great deal of attention in the near future. On 
the physical side, precisely because of the differences in the microscopic modeling of 
economics and physics, these systems pose a number of fascinating questions that open 
several directions for further work, some of which will be outlined here. 

The review is organized as follows. In Sec. 2 we present a general discussion 
of resource allocation by complex adaptive systems and a few exemplary models from 
different contexts hke ecology and traffic dynamics, including the El Parol problem. Sec. 
3 is devoted to the statistical analysis of optimal properties of large random economies, 
that is, more precisely, to a survey of the macroscopic properties of classical economic 
optimization problems. Most of our attention will be on the model of competitive 
equilibrium for linear production economies and on Von Neumann's model of economic 
growth. In Sec. 4 wc review the basic properties of the Minority Game, a minimal and 
yet highly non-trivial model of speculative trading derived from the El Parol problem, 
and discuss the role of the different parameters involved in its definition. Besides its 
physical richness, the Minority Game provides a simple adaptable framework where 
a number of important issues related to financial markets (such as the emergence 
of 'stylized facts', the role of different types of traders and the effect of information 
asymmetries) can be analyzed in great detail. Some of them are discussed in Sec. 
5. Pinally, some concluding remarks are expounded in Sec. 6. The main analytical 
techniques employed for these studies will be discussed in some detail only for cases 
where details are not available in the published literature: the replica technique for a 
model of a competitive ecosystem in Sec. 2; the continuous-time limit approach for the 
El Parol problem, also in Sec. 2; the dynamical generating functional for the canonical 
multi-asset Minority Game in Section 5. 

2. Statistical mechanics of resource allocation: some examples 

We start our discussion by introducing a general class of problems where a population 
of heterogeneous agents competes for the exploitation of a number of resources. Then 
we will discuss a few examples - ranging from ecosystems to urban traffic - where this 
generic framework can be formalized in specific models where the nature of resources 
and the laws governing the behavior of agents are completely specified. 

2.1. General considerations 

In a nutshell, the models we consider address the decentralized allocation of scarce 
resources by N heterogeneous selfish agents subject to public and/or private information. 
The word 'allocation' is to be intended here in a broad sense that includes the exchange 
of resources (for example, commodities) among agents, the production of resources by 
means of other resources and the consumption of resources. Agents take decisions on the 
basis of some type of information aiming at some pre-determined goals, like maximizing 
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a certain utility function, and are to various degrees adaptive entities. We shall consider 
cases in which they are perfect optimizers (or 'deductive') as well as cases in which 
their decision-making is governed by a learning process ('inductive'). Heterogeneity 
may reside in a number of factors, like the agents' initial endowments, their learning 
abilities or in how differently they react to the receipt of certain information patterns. 

In general, the allocation is a stochastic dynamical process, where the noise may 
be present in both the information sources and the agent's learning process. We shall 
mostly be concerned with the steady-state properties and, more than on individual 
performances, we shall focus on the resulting distribution of resource loads and in 
particular on 

a. how evenly are resources exploited on average (i.e. whether the allocation process 
leads typically to over- or under-exploitation of some resources) 

b. the fluctuations of resource loads (i.e. how large the deviations from the average 



In such contexts as production economies, ecosystems or traffic the meaning and the 
relevance of the above observables is immediately clear. In toy models of financial 
markets, where, as we shall see, the role of resources is played by information bits, 
the former quantity plays the role of a 'predictability' while the latter measures the 
'volatility'. 

It is implicitly assumed that optimal allocations are those where resources are 
exploited as evenly as possible and where fluctuations are minimal. In an economic 
setting, this corresponds to allocations with minimal waste whereas in financial markets, 
optimality implies information being correctly incorporated into prices with minimal 
volatility. 

In what follows, we shall denote by (• • •) time averages performed in the steady 
state: 



where Teq is an equilibration time. Moreover, we shall label agents by the index 
i e {1, . . . , N} and resources by the index n e {1, . . . , P}. In the statistical mechanics 
approach, the relevant limit is ultimately that where N ^ oo and P scales linearly with 
N, so that a = P/N remains finite as diverges. To give a loose name, we shall call the 
relative number of information patterns a, which will be our typical control parameter, 
the 'complexity' of the system. 

Denoting by Q^{t) the load of resource /i at time t, which is determined by the 
aggregate action of all agents (for instance, ^ may be a certain commodity and Q^{t) 
the demand for it at time t), one easily understands that the relevant macroscopic 
quantities are given respectively by 



can be) 




(1) 





(2) 
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{(Q) = {Q^)) which measures the deviation of the distribution of resource 

loads from uniformity (if 7^ at least one resource is overexploited or underexploited 
with respect to the average load) and by 



which measures the magnitude of fluctuations. Efficient steady states have H — and 
(T^ "small" in a sense that will be specified from case to case. To fix ideas, whenever 
fluctuations are smaller than those which would be obtained by zero-intelligence agents 
who act randomly and independently at every time step one can infer that agents are 
to some degree cooperating to reduce fluctuations. 

An important question we shall typically ask is how efficient are the steady-state 
resource loads distributions generated by a particular group of agents with a given 
information stream. Besides this, we shall also look at the inverse problem, namely 
under which conditions can a steady state satisfy criteria for efficiency. For example, 
what type of information should one inject into the system in order to facilitate the 
reach of a steady state in which H and cr^ are as small as possible? Indeed the structure 
of the information agents have access to may drastically affect global efficiency in many 
cases (e.g. traffic models). 

2.2. A simple model of ecological resource competition 

Ecosystems constitute a foremost example of the class of problems we outlined above [8]. 
The following can be seen as a minimal model of a competitive ecology with limited 
resources. Such a model will be taken as a prototype to illustrate the statistical 
mechanics (static) approach. The statistical mechanics approach to ecosystems has 
been pioneered in [9] based on the generating functional approach. The central issue is 
that of the May's biodiversity paradox [10], which shows that, contrary to expectations, 
increases in biodiversity in a random ecosystem enhance its instability. We shall indeed 
find the same result. 

2.2.1. Definition Let us consider a system with species whose populations ni{t) 
(i e (1, . . . , N}) are governed by Lotka-Volterra type of equations: 



Q^^ denotes the abundance of resource fi G {1, . . . ,P} (be it a mineral, a particular 
habitat, water. . . ) while g'f is a coefficient saying how much species i benefits from that 
resource. The constant fi is the population's decay rate 'in absence of resources'. To 
simplify things, we mimic the complex interdependence between species and resources 
by assuming that the grf s are independent, identically distributed quenched random 
variables. 




(3) 




p 



(4) 
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The abundance of resource ji depends on the population of each species, i.e. 

N 

Qm^Q'o-T.^^rij{t) (5) 

where Qq is the amount of resource /i that would be present in the system if no species 
fed on it. To fix ideas, let us suppose that 

= p + sVp (6) 

where s > is a constant and is a quenched Gaussian random variable with zero 
average and {x^x") = S^^. (The P-scaling is introduced in order to obtain a well-defined 
limit A^" — > oo, or P — > oo) Loosely speaking, the parameter s is related to the variability 
of resources: for small s, the resource level is roughly the same for all resources, while 
increasing s the distribution of resource levels gets less and less uniform. Clearly, the 
number of species that survive (i.e. such that nj(t) > 0) in the steady state will depend 
on a number of factors, like the distribution of available resources and how similar the 
species arc among themselves, that is on the distribution of 's, which we take to have 
first moments 

m = Q, «(?r - in = i (7) 

(here and in what follows we denote averages over the quenched disorder by ((■ ■ ■))) Along 
with the questions concerning the resulting resource loads distribution, an interesting 
problem to raise is the following: what is the typical maximum number of species that 
can be supported asymptotically when the number of resources P is large (P — > oo) as 
a function of s? 

This issue can be tackled by noting that 

1 ^= 

/i 1=1 

is a Lyapunov function of the dynamics (i.e. H{t) < 0; this can be easily shown by a 
direct calculation). This implies that the steady state properties are described by the 
minima of H over {rii > 0}. Note that in the steady state 

In the rest of this section we shall first work out in detail the minimization of H 
and then discuss the resulting scenario. 
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2.2.2. Statics (replica approach) The task of minimizing H can be carried out by 
introducing a 'partition sum' 

Z = IV„e-^^, n^im} (10) 

and applying the rephca trick: 

hm //min — \\ = — hm hm — — ((logZ)) = — hm hm hm — log ((Z'')) (11) 



The calculations are relatively straightforward. Using the Hubbard-Stratonovich trick 
we can write 
p 



Z^Trr 



where (■ ■ is an average over the Gaussian variables with (z^)^ — and {z^z'^)^ — 



5fj,^. So we have 



/ p N \ 



\lJ,=l i=l 



where the index a runs over replicas (a = 1, . . . , r). The first disorder average is done 

over Qq as given by (6) (thus more properly over x^), while the second is done over the 
s. The former is easily performed. As for the latter we note that if f3/N ^ 1 (which 
is the case since we first take the limit N ^ oo and then the limit —>■ oo), then 

We thus find 

\m=i /, 
The leading term in the above exponential is the first one. However, it corresponds to 
an undesirable super-extensive term in the free energy unless 

i 

If SO, the annoying term acts as a ^-distribution that ensures the above condition: 
Furthermore one sees that the relevant macroscopic order parameter is the overlap 

Gab^ Jj^riiaUib (14) 
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which can be introduced in the rephcated partition sum with the identities 



(15) 



for all a > b. Noting that when f3 ^ oo only the minima of H contribute to the partition 
sum, it is easy to understand that Gab measures how similar different minima a and b 
are to each other. We may now factorize over resources to obtain 



z 



so that, finally, factorizing over species, we arrive at 

((ZO) = y" e-^'''f^'"'^''^UwdGdR (16) 

with 

/(«;, G, R) = ^tr log [I + P {as' + G)]+^Y1 ^-^^-^ 

^ a>b 

rq^ rp \ / ^ 

where now {■ ■ ■) f stands for an average over the distribution of decay rates. By the 
principle of steepest descent, when N ^ oo, {{Z^)) is dominated by the saddle point 
values of the order parameters G, R and w (which we shall denote by a *) so 

hm min ^ = lim lim f{w\ G\ R*) (18) 

To proceed further, we assume that G*, R* and w* take the replica-symmetric (RS) 
formj 

Gl,^g+{G-g)5ab ^ 2r - {r + p/ P)5ab w^^w (19) 

which leads, in the limit r — > 0, to the free energy density 

Us{9, G, r, P,w)^^ log [1 + P{G - g)] + ^YTJi^^) ^ ^^^^ " 



^Gp--w-U log r dn e-/^^("l^'/)\ (20) 
2 Q P \ Jo / zj 



where the "potential" V is given by 

V{n\z, /) = ^apn' + {w + f/N - ^/arz) n (21) 

and the average (• • •)^ ^ is over both the unit Gaussian variable z and the decay rate 
/, whose distribution we left unspecified up to now. It is clear that if this distribution 

X This assumption gives the exact results in almost all the cases we shall discuss in this review because 
the functions to be minimized have a unique minimum. Should this condition fail, one must resort to 
more complicated Ansatze known as replica-symmetry breaking. 
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has finite moments and does not get broader with N, we can drop the term f /N above. 
Now let us take the remaining limit j3 ^ oo, where minima are selected, assuming that 
H has a unique minimum. In this case, clearly, G ^ g (there is only one minimum by 
assumption!) and we may look for solutions with 



hm /3{G -g)^x 

finite. Moreover, the last integral in (20) in the limit j3 
minimum of V . Therefore we end up with 

, , aas'^ + G 
lim t-oaiq.G.r, p,w) — 



(22) 

oo is dominated by the 



ar a 1 / 2\ / 



n) 



ar {zn} 



(23) 



where (• • •)^ are averages over the normal variable z, with the n — n*{z) which minimizes 
V: 



n*{z) 



ap 



z - zq)9{z - Zo) 



wj \fQX. 



Notice that this operation corresponds to an 'effective species' problem whose solution 
describes the collective behavior of the original A^-species system. 
The saddle point equations are 



dw 


= 




a 

<»>. = - 


dfns 
dp 


= 




= G 


<9/rs 
dr 


= 




(nz)^ = ^/arx 


dfRs 


= 




1 


dG 




^~ 1 + X 


dfRs 


- 




as'^ + G 


dx 







It is easier to find a parametric solution in terms of Zq: let us define 



dz 



-V2(^_^„)^£_^_|erfc(Vx/2) 



h{zo) = r -^e-^'l\z - zof = ^ (1 + z'o) erfc (zo/V2) - 

Iz{zo) = / -^e"^'/^2;(2; - 2^0) = ^erfc (zo/V2) 
Jzo "^/27r Z \ , 



-412 



27r 



After some manipulations we find 



1 

"=2 



(24) 



(25) 
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Figure 1. Behavior of ac as a function of sq. 



G 



X 



(26) 



The assumed scaling of parameters with /3, and hence the above equations, are vahd 
only for zq < Zq where Zq is the solution of 

Uzl) [h{zl) - h{zl)] = s\'ll{zl). (27) 

Indeed X ^ cxo as ^ -Zg. This singularity marks a phase transition at a point 
o^c = Iz{zq) between a phase a > Uc which is described by the equations above, and one 
where x = cxd. The critical point etc is a decreasing function of sq (from a value etc = 1/2 
for sq = 0) which rapidly vanishes as sq increases (it's already 10~^ for sq — A). It is 
reported in Fig. 1. At the transition, the susceptibility x ~ |q; — ad"^ diverges and 
the free energy, which as we said is proportional to the variance of the resource loads 
distribution, vanishes. This means that below ac all resources are exploited to the same 
extent, while above ac the resource load distribution is not uniform. For the fraction of 
surviving species (with rij > 0) we get 

dz 



<P = {9{z - zo)), 



r^e-^/^ = lerfcfw^ 



'2t: 

so (f) < a ior a > ac and — > a at ac- This means that below ac the number of 
surviving species equals that of resources while for a> ac there is on average less than 
one species per resource (or (j)/a < 1). The behavior of if, G and of the fraction of 
surviving species per resource 4>/a is displayed in Fig. 2 as a function of a for fixed sq. 



(28) 



2.2.3. Stability Note that at fixed q the maximal number P/«c of species that can be 
sustained in an ecosystem with P resources is an increasing function of s, so that by 
increasing the variability of resources the ecosystem gets more stable. The threshold of 
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Figure 2. Behavior of H, G and of the fraction of surviving species (j) as a. function 
of a for sq = 1. 



stability also increases if q increases. Having fixed the variance of to 1, increasing q 
means that species get more and more similar. This seems at first sight a contradictory 
scenario. To sort out this issue, let us analyze the linear stability of the system. Let 
ni{t) = Tii(oo) + a/ ni{oo)rii{t) where rij(oo) is the asymptotic value of the population of 
species i and r)i{t) is a small perturbation. To leading order, the dynamics is given by 

N 

m{t) = -J2A,,r],{t) (29) 
j=i 

with 

^ p 

The stability is related to the smallest eigenvalue A_ of A^j. This can be computed 
explicitly [11] and it is given by: 




This shows that the phase transition point, where etc = (p{cxc), is the onset of dynamical 
instability of the system. The presence of the factor l/g in A_ causes an interplay of the 
effects of increasing s and increasing q (since A_— >OasQ'— >oo), ultimately leading to 
a maximal stability for intermediate values of q, as can be seen by the behavior of A_ 
versus q, Fig. 3. It is also easy to show that A_ is an increasing function of a, for all 
values of s and q. Hence the introduction of new species always decreases the stability 
of the ecosystem, in agreement with May's classical result [10]. 
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Figure 3. Behavior of A_ as a function of q for a ~ 0.2 and s = 1. The ecosystem is 
marginally unstable when A_ = 0, whereas maximal stability occurs when A_ attains 
a maximum. 

2.3. The El Farol problem 

The El Farol problem is the paradigm of resource allocation games with inductive agents. 
It can be stated as follows [12]. N customers labeled i have to decide independently on 
each night t whether to attend (ai(t) = 1) or not (ai(t) = 0) the El Farol bar, which has 
a capacity oi L < N seats. The place is enjoyable only if it's not overcrowded, that is 
only if the attendance A{t) — doesn't exceed the number of seats. In order to 

make their decisions, customers aim at predicting whether the bar will be crowded or 
not on any given night based on the past attendances. 

In his seminal work, Arthur has pointed out the frustration inherent in such a 
situation. If everybody expects that the bar will be crowded, no one will go and the bar 
will be empty. Conversely all agents may attend the bar at the same time, if they all 
expect it to be empty. Hence he argued that this is a situation which forces expectations 
of different agents to diverge. It is reasonable to think that, if agents start with different 
expectation models and revise them according to the history of the attendance, their 
expectations will never converge and agents' heterogeneity will be preserved forever. 
He then showed by computer experiments with N — 100 and L — 60 that inductive 
agents endowed with fixed 'predictors' (namely look-up tables associating to each series 
of past attendances a binary decision like go/don't go) are able to self-organize so that 
the attendance A{t) fluctuates around the comfort level L. 

Note that the El Farol problem can be regarded as an embryonic market where 
L units of an asset or a commodity must be allocated on any given day t. They are 
offered to agents who may decide to invest 1€ to buy it (aj(t) = 1) or not {ai{t) = 0). 
The attendance A{t) is then the demand of the asset (the number of available units, 
or supply, is fixed at L). Each unit of asset delivers a return of 1€ to its owner at the 
end of the period. Imagine that the price at which the asset is sold is determined at 
each period by a market clearing condition (demand = supply): A{t) — Lp{t). Then an 
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Figure 4. De Bruijn graph of order 3 (from [15]). 

agent who invests ai{t)'€. in the asset, receives ai{t)/p{t) units of it. These will be worth 
ttiit) / p{t)'€. at the end of the period. If p{t) > 1, which occurs if A(t) > L (crowded 
bar), it is not convenient to invest (attend). lip{t) < 1 it is instead worthwhile to invest 
(attend). 

2.3.1. Definition In what follows, we focus on a tractable version of the model that 
differs from Arthur's original work in the form of the predictors but preserves all the 
main qualitative features of the model [13]. In order to formalize the problem it is 

reasonable to assume that (i) customers have a finite memory, that is, their analyzing 
power is limited and they must base their prediction on the attendances of a finite 
number (say, m) of past nights, and that (ii) they are insensitive to the actual size of 
the attendance (perhaps simply because they don't have access to it) and rather only 
know whether the bar was overcrowded or not on a given night. This means that the 
information available to customers on night t is encoded in the string 

^(t) = {e{L-A{t-l)),...,9{L- A{t -m))}e {0, 1}"^ (32) 

where 6{-) is the Heaviside function: 6{L — Aif)) = 1 if the bar is enjoyable (A(t) < V) 
while Q{L — A(t)) = if the bar is overcrowded (A(t) > V). The time evolution of the 
string /i(t) is governed in time by the map 

[xilu + 1) = [2^(t) + B{L - A{t))\ mod(2™) (33) 

The above equation completely defines the structure of the information available to 
agents in the case in which they base themselves on the past attendances. Graphically, 
the evolution of history strings is constrained to occur on a de Bruijn graph [14] of order 
m. Fig. 4. 

We shall consider, for comparison, another possibility, namely that the information 
supplied to customers is a random binary string of length m or equivalently a random 
integer ('information pattern') drawn from {1, . . . , 2"* = P} with equal probability at 
each time step. We shall refer to the latter as the case of exogenous random information, 
as opposed to the former of endogenous information. The obvious difference between 
the two choices is that while in the latter case the space of informations is sampled 
uniformly by construction, in the former this is in principle not true. There is however a 
deeper difference that has serious consequences on the analytical solubility of the model: 
in the case of random information the dynamics is Markovian. 
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Having defined the information source, let us specify the agents' behavior. Even 
in a simphfied context, making the optimal decision for each given string requires 
an unrealistic computational capacity that should be shared by all agents. Inductive 
reasoning requires that customers stick instead to simple decision rules. In particular, 
we assume that they have at their disposal a small number S of 2"*-dimensional vectors 
called 'strategies' (analog to Arthur's predictors) that map information strings into 
binary actions (go/don't go): 

a,, : {0, 1}'" 3 ;x ^ e {0, 1} = 1, . . . , iV; = 1, . . . , 5) (34) 

In the table below one such possible strategy is shown for m = 3 (or P = 8). 



past attendance string 


pattern 


decision 


000 


1 


1 


001 


2 





010 


3 





Oil 


4 


1 


100 


5 


1 


101 


6 





110 


7 


1 


111 


8 






Customers are heterogeneous as of course different agents have different strategies. 
This is modeled by assuming that each component af^ of every strategy aig is drawn 
independently for all i, and g with probability distribution 

P{a) = a5{a - 1) + (1 - a) 5{a) (35) 

where a is the average attendance frequency of agents. Strategies are assigned to agents 
at time t — Q and are kept fixed throughout the game. In order to decide which strategy 
to adopt on every night, agents keep tracks of their performance via a score function 
that is updated according to the following rule: 

Uig{t + 1) - Uig{t) = (l - 2aff ) [A{t) - L] (36) 

with the rationale that strategies suggesting not to go (a^^*^ = 0) are rewarded when 
the attendance is higher than L and punished when it is lower than L (and vice versa 
when afg*^ = 1)- Then on each night every agent selects the strategy with the highest 
cumulated score: 

gi{t) = arg maxUig{t) (37) 

9 

and acts accordingly: ai{t) — of^^J^y In short, the model's rules can be summarized as 
follows: 

gi{t) = arg maxC/jg(i) 

9 
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Figure 5. Average deviation of the attendance from tlie comfort level (top) and 
fluctuations (bottom) versus aN / L for endogenous (solid lines) and exogenous (dashed 
lines) information (from [13]). 

^W = E€i) (38) 

i 

U,git + 1) - f/.,(t) = (l - 2arf ) [Ait) - L] 

(from top to bottom: strategy selection; aggregation; updating). It is understood that 
scores are initialized at time t = at certain values Uig{0). 

2.3.2. Macroscopic properties After a transient, the dynamics defined by (38) will 
reach a steady state whose global efficiency can be conveniently characterized by two 
parameters: the average deviation of the attendance from the comfort level L, {A — L) 
and its fluctuations = {{A — L)"^). The former measures the degree to which agents 
coordinate to generate attendances around the comfort level. The latter measures the 
waste of resources: the larger cr^ the bigger the deviations of the attendance from 
the comfort level (in either direction). In a nutshell, it quantifles the quality of the 
coordination. The behavior of the two quantities (properly normalized with A^) at flxed 
L = 60,a = l/2 and m = 2, 3, 6 and varying is displayed in Fig. 5 for endogenous 
(solid lines) and random (dashed lines) information. In the former case, a general feature 
that emerges is that the average attendance settles at the comfort level in a window of 
values of a centered around L/N whose size shrinks as m increases. Out of this window, 
sensible deviations may occur. In parallel, fluctuations are maximal at a = L/N for 
m = 2 and the height of the maximum decreases with increasing m until it disappears. 
This implies that the waste of resources is comparatively larger when m is smaller, so 
that for instance the fraction of losers is larger for small m. Thus one can say that global 
efficiency increases when m increases. The behavior in the case of random information 
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Figure 6. Normalized entropy Sm/m versus aN/L for different memory lengths m 
(from [13]). 



is qualitatively similar to the previous case in the vicinity of aN/L — 1. Quantitative 
deviations occur outside this region. 

Based on this, one expects that with endogenous information the information space 
is sampled uniformly around a ~ L/N . This is indeed so. To see it, one can measure 
the frequency with which histories are sampled in the steady state, p{ii)i and calculate 
the entropy 



such that S{m) = m when the information space is sampled uniformly (the 'effective' 
number of information patterns visited by the dynamics is 2'^^"*^). As shown in Fig. 
6, S{m)/m ~ 1 only for when a ~ L/N. Outside this phase, the entropy decreases, 
signaling that the information dynamics is biased. 

These findings indicate that the degree to which inductive agents are able to 
coordinate the exploitation of the limited resource in a way that is collectively efficient 
depends on the size of the information space they base themselves on. While the average 
level of activity always settles at the resource level, fluctuations get smaller and smaller 
as the information space grows. When the average attendance frequency is close to 
L/N, then, the particular nature of the information provided to agents doesn't affect the 
stationary macroscopic properties. The relevant requirement is that all agents possess 
the same information, independently of whether it's the true attendance history or a 
random string. 

2.3.3. Dynamics (continuous-time limit approach) The mathematical analysis of this 
model can be carried out in the case of exogenous information by studying the 
continuous-time limit of (38) along the lines of [16]. A few simplifications are necessary 
to this aim. First note that the dynamics (38) is non-linear in a way that doesn't allow 




(39) 
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to write it in the form of a gradient descent, that is physically the model is defined by 
a set of globally coupled Markov processes that violate detailed balance and it is not 
clear that a Lyapunov function exists. It is however possible to regularize the dynamics 
by smoothing the choice rule Qiit) = arg max^ Uig(t) to 

PToh{gi{t) ^g}^ C{t) e^^">^^^ C{t) = normahzation (40) 

with r > the 'learning rate' of agents (the original choice is recovered for F — > oo). 
This modification is not without consequences and F indeed turns out to play a rather 
non-trivial role in the macroscopic properties. With (40), it is possible to construct the 
continuous-time limit of (38). 

The crucial observation is that there is a 'natural' characteristic time scale for 
the dynamics given by P (intuitively, agents have to check the efficiency of their 
strategies against all information patterns before they can evaluate their performance 
meaningfully). This implies that if one is interested in steady state properties, time 
should be re-scaled as t — > r = t/P. Iterating (38) from time t — Pr to time 
t — P{t + dr) and setting Uig{T) — Uig{PT) one obtains 

P(r+dr) 

u,g{T + dr) - u,g{T) ^ - J2 (l-2aff)Kt)-L] (41) 

t = PT 

The arguments of the sum on the right-hand side can be separated into a deterministic 
and a fluctuating term: 

(l - 2af ) [A{t) -L] = (1 - 2a,,) {[A{t) - L])^ + X,g{t) (42) 

where we used the fact that information is exogenous and random and we denoted by 
(• • •)^ an average over the distributions 

P(T+dr) 

= ^ E C{t)e'^^^^'^ (43) 



Pdr 

t=PT 



We have therefore 



Uig{T + dr) - u,g{T) = (1 - 2aig) {[A{t) - L])^ dT + dWig{T) (44) 

where dWig{T) = {i/P)^^l'f^^'^^ Xig{t) is a noise term whose statistics (average and 
correlations) can be derived by noting that Xig{t) are independent identically-distributed 
zero-average random variables, so {dWig{T)) = and 

/ P(T+dT) P(T'+dT) \ 

{dW,g{T)dW,giT')) = /— E E X,g{t)X,git')) 

= ^-^^y^{XMX,g,{t))^dT (45) 

The remaining term, {Xig{t)Xjgi{t))^ can be evaluated from the statistics of disorder 
and of A{t). Finally, taking the limit dr ^ one arrives at the following Langevin 
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process: 

u,,{t) = (1 - 2a,,) {[A{t) - L\)^ + 77,,(t) 

(%(t)) = (46) 

Mr)ri,Ar')) ^ ^^^~/^'^- (2a..-l)(2a,,,-l)^(r - r') 

where in the last relation we have factorized the average over //'s. Note also that the 
averages on the right-hand side of Eq. (46) are taken at fixed Hig{T) = C{t) exp[rMjg(T)] 
so they are themselves time-dependent. Therefore (46) is a set of complex, non-linear 
stochastic differential equations in which the noise correlation is also time-dependent. 
At the same time, the probability to choose a predictor g, TTiglr), is easily seen to satisfy, 
in the re-scaled time Fr = r the stochastic equation 

T^igir) = 7:ig{T)F[7r] + VfG[7r, rj] (47) 

where F and G are F- independent functions whose form is not relevant for our scopes. 
This tells us that agents' preferences are subject to stochastic fluctuations of strength 
proportional to VT around their average. The larger F the longer it takes to average 
fluctuations out. Moreover only in the limit F 0, in which the dynamics if the Trig's 
(and consequently of the Mjg's) becomes deterministic, the system performs a gradient 
descent with the Lyapunov function 

^ = ^ E ((^1-") - ' (^1-") = E f^9< (48) 

where (■ • ■ denotes a time-average in the steady state conditioned on the occurrence 
of pattern /i: 

^ T T 



t=T, 



eq 



and fig = {'Kig). 

Thus the minima of H over fig (subject to fig — 1 for all i) describe the steady 
state. From a physical viewpoint, H measures the amount of exploitable information 
produced in the system, or the 'predictability': if e.g. {A\v) ^ L, the signal fi{t) carries 
information which is useful to predict whether one should attend or not to the bar 
when n{t) = u. The fact that the stationary state corresponds to minimal H means 
that agents exploit to their best the system's predictability. We shall term phases 
with H — 'unpredictable' or 'symmetric', while phases with H > will be called 
'predictable' or 'asymmetric'. 

Notice also that the noise correlations are proportional to the volatility cr^ = 
{{A — Vf) which in turn depends on the set of all ttig's. Hence calculating the volatility 
requires solving a much more complex self-consistent problem. 

The minimization can be carried out analytically resorting again to the replica trick. 
The thermodynamic limit to be considered in this case is iV — > cxo with I = LjN and 
a — P/N finite. The interesting case is that where the average attendance frequency a 
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Figure 7. Phase diagrain of the El Farol bar problem. The solid line encloses the 
'unpredictable' phase where H = 0. The dashed lines correspond to the trajectories 
of systems with L = 60, a = 1/2 and m = 2, . . . , 6 as the number of agents increases 
(from bottom to top). The dot-dashed line corresponds to a typical trajectory of a 
system with fixed L, N and a > L/N as the agents' memory changes (from [13]). 



fluctuates around £ so that a — £ — 0{1/VN). Indeed, iia — £ — 0{1) then each agent 
wiU always use the strategy that prescribes him to go more (resp. less) often ii a < £ 
(resp. a > £). A convenient parametrization is given by 



a~i = ^f-^^ (50) 

with 7 finite and independent of N. The resulting phase diagram in the (0,7) plane is 
reported in Fig. 7. We see a region for small a and small 7 where H = 0, i.e. (A) = L. 
In this 'unpredictable' phase the average attendance converges to the comfort level but 
fluctuations are large. On the other hand, the typical attendance differs from L outside 
this phase. Looking at the m-dependence, we see that as varies with L and a and m 
flxed, the system follows the trajectories shown in dashed lines. For small values of m 
these cross the symmetric phase in the region aN ~ L. 

This rich phenomenology, and speciflcally the non-trivial interplay between 
predictability and fluctuations, is characteristic of the complexity of many other 
resource-allocation models, two of which we shall now discuss. 

2.4- Buyers and sellers in the 'fish market' 

Market organization, namely the establishment of stable relationships between buyers 
and sellers, is one of the basic mechanisms that determine the efficiency of commodity 
markets. An important question concerns the effects that organization has on prices 
and their fluctuations. This issue has been investigated in detail in [17] in the context 
of an empirical study of the Marseille flsh market. This is the sense in which this 
section refers to a model of a 'fish market'. Loosely speaking, one can think that a 
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seller with loyal buyers has an incentive to take advantage of the situation by raising 
prices, thus removing the incentive of buyers to be loyal to him. Once the relationship 
is broken, buyers will seek cheaper sellers thus driving a reduction of the average price. 
This mechanism however is expected to cause an increase of fluctuations (and thus a 
decrease of cost certainty), since in the 'disorganized' phase buyers will be switching from 
one seller to another. This elementary scenario, from which it is clear that efficiency is 
a two-sided concept, is worth of a deeper investigation. A highly stylized yet non-trivial 
model addressing this issue was introduced in [18]. 

One considers a system with N buyers and P sellers, which for simplicity may be 
assumed to sell different commodities each (say each seller supplies a different type of 
fish). Ultimately, the limit N ^ oo with n — N/P finite shall be considered. On 
each day t = 1,2,..., every consumer i has to acquire one of S possible bundles of 
commodities, for instance for his or her subsistence. A bundle is a vector q^g = {q^g} such 
that q^g denotes the amount of goods buyer i demands from seller (/x G {1, . . . , P}). 
g E {1, . . . , 5'} labels the different feasible bundles. We are interested to model the case 
in which buyers are heterogeneous, in the sense that different buyers have different needs 
and thus different possible bundles. We therefore assume that bundles q^g are quenched 
random vectors with probability distribution 

= n - - ^'9) + ^^^^tA , (51) 

(0 < 0' < 1 being the probability that any given commodity is part of a bundle) that 
are assigned to consumers independently on i and g on day t = and are kept fixed. 
In this way, we introduce a further simplification in that each seller is either visited or 
not by a buyer, and the purchased quantities play no role. Moreover, we are implicitly 
assuming that the different commodities are equivalent to consumers, that is there is 
no commodity that all buyers will need to buy. Coming to sellers, we assume that they 
set the daily price of their commodity according to the demand they receive, denoted 
by Di^{t), so that the higher the demand the higher the price. Each buyer, on the other 
hand, aims at purchasing, on each day, the bimdle he or she finds more convenient, 
labeled by gi{t), with the limitation that when the choice is made the price at which 
the purchase will take place is not known yet (it is determined by the collective decision 
of all consumers, which form the demands). Hence they try to learn the convenience of 
different bundles from experience in order to be able to predict which bundle will have 
the highest marginal utility on any given day. The events taking place on each day t 
can be summarized by the following scheme: 

gi{t) = arg meixUig{t) 
9 

i 

u,,{t + 1) - u,,{t) ^'^(^)] 

At the decision stage, each buyer chooses the bundle which carries the highest 
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(cumulated) utility Uig(t). The different choices are then aggregated and the demands 
are formed. Finally utilities are updated with the following rationale: if the demand of 
a commodity fi is above a certain threshold k, consumers perceive that commodity as 
too costly and the utility of all of his feasible bundles that include it will tend to be 
reduced. Similarly, if the demand has been lower than k the commodity will be seen as 
'cheap' and will tend to increase the utility of the bundles that contain it. The utility 
of a bundle is determined by the demands of all commodities in it. Finally, we assume 
that the score updating is initialized at values Uig{0). 

It is clear that k plays in this model the role of the comfort level L of the El Farol 
problem. Based on the discussion of the previous section, we concentrate on the case 
k — N{1 — q). The relevant macroscopic observables are given by 



H measures of how evenly buyers are distributed over sellers. Indeed ii H — 0, each 
seller receives on average the same demand so that none of them is perceived as more 
convenient by buyers. In this case, consumers are distributed uniformly over producers. 
If H > 0, instead, the distribution of demands is not uniform and some producers are 
seen as more or less convenient than others. A represents instead the magnitude of 
demand fluctuations. Note that because of our assumptions on the relation between 
prices and demands, if is a proxy for the average price whereas A quantifies the typical 
spread of prices in the economy. Note also that when if > an external agent who 
watches the economy from the outside trying to identify the best bargain would manage 
to find more convenient sellers and make a profit. When H — 0, instead, this would 
not be possible. So transitions from regimes with if > to regimes with H = can 
be seen as transitions between inefficient and efficient states of the economy, where by 
efficient state we mean one where goods flow from sellers to buyers in such a way that 
no information exploitable by an external agent is generated. States that arc optimal 
from a collective perspective have both H = and A small, because on one hand a 
uniform demand distribution is desirable and on the other price fluctuations should be 
such that agents have as much cost certainty as possible on a day by day basis. Hence 
H and A describe intertwined properties, and it is on their mutual dependence that we 
shall focus in what follows. 

Results are shown in Fig. 8. The behavior of H indicates that as the number of 
buyers increases they tend to distribute more and more uniformly over sellers until, 
for n = ric, H vanishes and the distribution becomes uniform. For n < ric the 
economy is inefficient as the uneven distribution of demands generates exploitable profit 
opportunities. For n > ric the economy is instead efficient. Notice that results are 
indeed independent of initial conditions Uig{0) in the inefficient phase, while for n < ric 
ergodicity is broken and the steady state depends on initial conditions. Furthermore, 




(53) 



(54) 




Figure 8. Behavior of H and E = A + iJ as a function of n for various q and flat 
{Uii{Q) = ?7i2(0)) and biased ([[/ii(0) — C/i2(0)] = 0.2) initial conditions (analytical 
curves and numerical simulations, from [18]). 



we see that in the inefficient phase fluctuations are small whereas when the economy 
becomes efficient the dependence on initial conditions may drive the system to both 
states with large price fluctuations where (A ~ n), which are rather undesirable, and 
states with small fluctuations (where A ~ V"^)- This can be interpreted with the 
following mechanism. When there are few buyers, many sellers receive small demands 
and thus the economy presents many profitable opportunities. As more and more 
buyers join the opportunity window shrinks and players may be forced to switch bundles 
repeatedly in the attempt to identify convenient commodities. This leads to the increase 
of fluctuations and ultimately to a loss of day-by-day cost certainty. 

Like the models described before, this one can also be studied analytically by 
resorting to a replica minimization. It is not difficult to see that the Lyapunov function in 
this case is precisely if, so buyers collectively act so as to exploit profitable opportunities 
as much as possibles. In [18] a different solution method, based on dynamical generating 
functional, is employed. We defer a discussion of this technique to Sec. 5.3. 

2.5. Route choice behavior and urban traffic 

A most striking example of the influence of different information structures on the 
stationary properties of these systems has been given in the experimental literature on 
behavioral aspects of route-choice dynamics in vehicular traffic [19,20]. Experiments 
dealt with groups of people having to choose at each time step (day) between two 
alternatives (routes), having at their disposals a certain externally provided information 
about the aggregate daily result, a sort of tunable traffic bulletin. The payoff for each 
choice depends on the number of agents making that choice in such a way that the larger 
this number the smaller the payoff. Experiments have shown that while agents were able 
to adapt rather well and reach states that were efficient on average, the overreaction. 
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Figure 9. Regular grid with two routes for traveling from A to B. 



namely the fluctuations or the difference between the optimal rate of decision change 
by agents and the actual rate of change, displayed a strong dependence on the type of 
information supplied, for instance with or without impact correction, time-dependent, 
user-dependent etc. In particular, the overall best states (smallest overreaction) were 
attained when the information is user-specific (see however [19,20] for additional details 
and more results). 

The issue of how the information structure affects macroscopic properties has 
been tackled in a traffic-inspired resource allocation game which can be seen, roughly 
speaking, as a lattice version of the previous 'fish-market' model [21]. Let us consider 
the following situation. A road network, which for simplicity is taken to be a square 
lattice with sites, is given. On each day, each one of N drivers has to travel from 
location A to location B (say, work/home) following one of S possible routes. The points 
A and B are different for different drivers while the routes at their disposal are taken to 
be S quenched random self-avoiding walks of length ^ going from A to B (see Fig. 9). 
Routes play here the role of the predictors of the El Parol problem and of the feasible 
bundles of the fish market: indexing lattice edges by each route g of every driver i 
can be written as a vector q^g — {q'^g}, where q'^g = 1 if driver i passes through edge ji in 
route and q^^g = otherwise. Drivers are assumed to be inductive and their behavior 
is governed by the following rules: 

Prob{^,(t) =g}^ C{t) exp [VUig{t)\ 

i 

u,,(t + 1) - u,,(t) = E €Q'(*) + ^ (1 - ^9Mt)) CM 

Let us discuss them in some detail. The first one says that agents choose their preferred 
route on day t, gi{t), using a probabilistic model with learning rate F > 0. Q^{t) denotes 
the traffic load on street ii on day t. The score updating process is composed of two 
parts: 

• the first term, — p Yli^i^igQ^{^)-i ^^y^ ^^^^ agents prefer less crowded routes; 
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• the second term, | (l — ^g,gi{t)) dgi't) is non-zero only for routes the driver has not 
taken on any given day and represents the information noise, or the inaccuracy 
with which he knows the traffic load on network edges he hasn't visited. Cigit) is a 
Gaussian noise with mean 7] and correlations 

{Cig{t)Cjh{t')) = ASij6,kSu' (56) 
Different information structures correspond to different values of 77 and A: 

• the case 77 = A = (no information noise) corresponds to the case in which all 
drivers possess complete knowledge of the traffic load on each network edge on 
every day 

• for A > the information about unvisited edges is user-specific and noisy. In 
particular 

— for 77 = the noise is unbiased 

— for ?7 > the driver overestimates the performance of routes not taken 

— for ?7 < the driver underestimates the performance of routes not taken 

Let us notice, en passant, that smart drivers should be aware of the fact that the traffic 
load on a given route would have been larger had they chosen it and therefore they 
should underestimate the efficiency of unused routes. In other words, drivers account 
for their impact on the traffic loads when they are able to disentangle their contribution 
to it (their 'impact') before updating their scores. In this model, drivers completely 
account for their contribution to the traffic for rj — —2. We shall see however that 
any small 77 < is sufficient to alter significantly the collective properties. We shall 
distinguish between two types of drivers: 'random drivers' with F = 0, who choose their 
route every day at random with equal probability, and 'optimizers' with F = 00, who 
every day choose the route they expect to be faster. 

As usual, one is interested in the collective properties in the steady state. We have 
several control parameters, namely r], A, F and the vehicle density c = N/P. The 
observables we focus on are 

^"-^T.({Q'-^y) (58) 

where as usual (• • •) stands for a time average over the stationary state of the learning 
dynamics. Just as in the fish-market model, H describes the distribution of drivers 
over the street network in the stationary state, li H — 0, the distribution is uniform 
{{Q^) — (Q) for all /^) aiid it is not possible to find less crowded streets on the grid. If 
H > 0, instead, the distribution is not uniform and fast pathways do exist. Notice that 
if transit times are assumed to be proportional to the street loads Q^, then cr^ measures 
the total traveling time of drivers. Then, the optimal road usage is achieved when o"^ is 
minimal. Note that since all routes have the same total length i (i.e. Qig = ^ for all 
i and g), (Q) = c£ is a constant. 
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Figure 10. Left panel: cr^/TV (top) and H/N (bottom) for random drivers (open 
symbols) and optimizers (closed symbols). Simulation parameters: S = 2, P = 200, 
£ = 50, Uig{0) = for all i and g. Averages are taken over at least 50 disorder samples 
for each point. The solid line in the graphs is the analytic estimate of H and (for 
r = 0+) for the model with uncorrelatcd disorder. Right panel: behavior of cr^/iV for 
a city of P = 200 streets with N = 1024 drivers (c = 5.12, in the inefficient phase) as 
a function of the parameter A with r] = 0. The horizontal lines correspond to drivers 
with r = (dashed) and F = 0+ (dotted). Results for optimizers with F = oo are 
shown for equilibration times icq = 100 (full line), 400 (open circles) and 1600 (full 
diamonds). In the inset, data are plotted versus Atli" (from [21]). 



Numerical simulations for = A = reveal the picture displayed in Fig. 10). 
We see that random drivers lead to a stationary state where a uniform distribution 
of vehicles is never achieved, as if > for all c. Optimizers, instead, behave in a 
similar way only for small vehicle densities. As c is increased, the traffic load becomes 
more and more uniform {H decreases) and fluctuations (cr^) decrease, indicating that 
inductive drivers manage to behave better than random ones. At a critical point Cc — 3 
the distribution becomes uniform (i.e. H — 0) and vehicles fill the available streets 
uniformly. Now drivers can't find a convenient way and are forced to change route very 
frequently. As a consequence, global fiuctuations increase dramatically. Notice that 
above the critical point traffic fiuctuations are significantly smaller for random drivers 
than for optimizers. Finally, the stationary state depends on the initial conditions Uig{0) 
for c > Cc'. the larger the initial spread, the smaller the value of cr^. The conclusion is 
that random drivers lead to an overall more efficient state in conditions of heavy traffic 
while optimizers perform better when the car density is low. 

Unfortunately, the analytical side of this model is much harder than the previous 
examples because the quenched disorder (the feasible routes) is in this case spatially 
correlated. It is possible however to solve analytically a milder version with uncorrelated 
disorder. Results (shown in the Fig. 10) reproduce the qualitative behavior described 
above fairly well, and predict a critical density of Cc = 2.97 .. ..). 

For 77 = and A > 0, the dependence on initial conditions disappears and is 
replaced by a non-trivial dynamical behavior (see again Fig. 10). In the high density 
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Figure 11. Behavior of /N for a city of P = 200 streets with rj = —2. Adaptive 
(resp. random) drivers have F = oo (resp. F = 0) (from [21]). 



phase, where drivers would behave worse than random with A = 0, global efficiency 
can improve beyond the random threshold if A > 0. Taking averages after a fixed 
equilibration time teq, we find that reaches, for A ^ 40, a minimum that is well below 
the value of for F = O"*" with the same homogeneous initial conditions yi{0) = 0. For 
A — > oo we recover the behavior of random drivers. However, when we increase ^eq; 
the curve shifts to the left, showing that the system is not in a steady state. Rescahng 
A by t'^^'^, the decreasing part of the plot collapses, while the the rest of the curve 
flattens. This suggests that the equilibrium value of cr^ drops suddenly as soon as 
A > 0. Loosely speaking: noise-corrupted user-specific information can avoid crowd 
effects when the vehicle density is very high. 

We finally come to the case rj ^ and A = 0, Fig. 11. While one observes 
no qualitative changes for > 0, for 77 < fiuctuations arc drastically reduced in 
the supercritical phase. In particular, for rj = —2 (when, as we said above, drivers 
completely account for their contribution to the traffic) the dynamics converges to a 
state characterized by no traffic fluctuations (cr^ = H) because each driver selects one 
route and sticks to it. 

Hence this setup allows to address the impact of different information structures, 
and thus of different types of information broadcasting, on the collective properties of 
urban traffic. This is perhaps one of the most promising research lines with respect to 
applications opened by resource allocation games so far. 

3. Optimal properties of large random economies 
3.1. Introduction 



The standard tenet of microeconomics is that economic activity is aimed at the efficient 
allocation of scarce resources [22]. As we said before, 'allocation' includes exchange. 
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production and consumption of commodities. The concept of 'efficiency' is instead 
usually connected to the sohitions of constrained maximum and/or minimum problems, 
as for instance firms strive to maximize profits at minimum costs while the goal 
of consumers is to maximize their utility subject to their budget constraints. The 
fundamental concept by which mathematical economists explain the emergence of 
efficient states from the disparate choices of individual agents in economic systems is 
that of 'equilibrium', that is a state where all agents maximize their objective functions 
and the waste of resources - in the form of imbalance between demand and supply 
- is minimum (actually, zero). Typical results concern the existence and stability 
of equilibria for different types of economies (see below for a precise definition). In 
such settings it is however extremely difficult to extract meaningful macroscopic laws 
(comparable with empirical data) from the mathematical results, in great part because 
of the difficulties in handling agents' heterogeneity effectively. In what follows, we 
will show that when heterogeneity is taken properly into account the structure of 
equilibria of model economies (as well as of other related optimization problems of 
microeconomics) proves to be rich and non-trivial. We shall review the collective 
properties of a few exemplary linear optimization problems of microeconomics, whose 
setting will be borrowed from the economic literature [23]. We will see that the 
emerging scenario presents in all cases two distinct regimes: an expanding phase where 
technological innovations lead to an overall economic growth, and a saturated regime 
where growth is not achieved by technological innovation but rather by a diversification 
of the production. The key technical role in our analysis is played by the replica 
method and the transitions between expanding and contracting states can be completely 
characterized by a few macroscopic order parameters. Remarkably, it will turn out that 
the physical order parameters that arise bear an immediate economic interpretation. 

3.2. Meeting demands at minimum costs 

To begin with, we consider the simple linear model of production to meet demand 
satisfying an optimahty criterion [24, 25]. This illustrates the general two-phase 
phenomenology described above in an extremely simplified setting. Let there be N 
processes (or technologies) labeled by i and P commodities labeled by Each process 
allows the transformation of some commodities (inputs) into others (outputs) and is 
characterized by an input-output vector = {^f } where negative (positive) components 
represent inputs (outputs). Each process can be operated at any scale Sj > 0. The scales 
Si must be chosen so that the total amount of commodity /i that is produced (consumed) 
matches a fixed demand (availability): Sj^^^ = k'^ for all //., where the thresholds k'^ 
may be positive (for goods one wants to be produced) or negative (for goods to be 
consumed). Among all feasible states {sj}, one may select the one which minimizes a 
particular function of the Sj. Here we take the simplest choice of a linear combination 
X^i ^iPij which can be thought of as the total operating cost, if pi is seen as the operation 
cost at unit scale. 
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Figure 12. Fraction of active processes vs n for pi = 1. Inset: (/)n vs n for the 
same parameter values, s are taken to be Gaussian variables with variance 1/P 
such that = —0.001 for all i. k^'s are sampled from the bimodal distribution 

q{K) = ^5{k + 1) + ^5{k - 1) with m = 0.1 (from [25]). 

We ask the following question: how does the operation pattern (e.g. the fraction 
of active processes such that Si > 0) change when N increases, i.e. as more technologies 
become available? Indeed, the macroscopic structure of the efficient state must be 
expected to depend on the ratio N/P: ior N <^ P a, technology will be more likely 
to be active {si > 0) than for N ^ P, when selection will be stronger and processes 
performing the required conversions more efficiently will be favored. This problem can 
be tackled by methods of statistical mechanics in the limit N ^ oo with n = N/P finite 
upon assuming that the s arc quenched random variables (similarly to what has been 
done for other linear optimization problems such as the knapsack problem [26-28]). A 
further important requirement is that 'Y^^ii < for all i, which ensures that processes 
cannot be combined to yield a technology with only outputs. We refer the reader to [25] 
for details and focus on the emerging picture (see Fig. 12). One sees that for small n 
roughly a half of the processes are active. This means that as n increases, that is as 
more and more technologies become available, the number of active processes per good 
increases (see inset) i.e. the arrival of new technologies favors existing ones. The picture 
changes radically for n > 2, as starts to decrease and n(f) = 1. Now the number of active 
processes equals that of commodities and technologies undergo a much stronger selection 
which reduces the probability that a randomly drawn input-output vector is active. This 
simple model describes in a nutshell a transition to a highly competitive state where all 
possible productions are saturated by existing technologies and an increase in activity 
levels can be achieved only by increasing P. We shall see below that a similar picture 
extends to the more complicated case of general equilibrium. 
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3.3. Competitive equilibria of linear economies 

An economy can be seen as a complex system of interacting agents (consumers, firms, 
banks etc.) with conflicting goals and complementarities. It is indeed the heterogeneity 
of the agents which drives the economic process. Surely, if all agents were identical with 
identical endowments, there would be no trade. Modeling an economy as a system of 
heterogeneous agents is however a quite complex task [29]. In this section we review 
how statistical mechanics may be helpful in deriving the macroscopic properties of large 
random economies. Specifically this approach allows one to derive statistical laws that 
provide a picture of how structural properties are affected by changes of macroscopic 
parameters. This is the same type of information than random matrix theory provides 
about the structure of heavy nuclei [1,30]. 

3.3.1. Definition We stick to the standard microeconomic setup (see e.g. [23]). An 
economy is defined as a system of firms labeled by i, P commodities labeled by 
and L consumers labeled by £. Each firm is endowed with a technology that allows the 
transformation of some commodities, called 'inputs', into others, called 'outpTits'. Every 
technology is completely characterized by its 'input-output vector' = {^f }, where 
negative (respectively positive) components represent quantities of inputs (respectively 
outputs), and can be operated at any scale Sj > 0, meaning that when run at scale Si 
it produces or consumes a quantity Sj^f of commodity The price of commodities is 
given by the 'price vector' p — {p^ > 0}. Each consumer is characterized by his/her 
initial endowment of commodities — {y^ > 0} and by his/her utility function U^, 
associating to every bundle of goods x — {x^ > 0} a real number Ui{x) representing 
his/her degree of satisfaction. 

It is assumed that firms choose their activity levels Sj so as to maximize their profits 
TTi for a fixed price vector p: 

maxTTj with tTj — Si{p • (59) 

Si>0 

On the other hand, consumers choose their consumptions x^ so as to maximize their 
utilities within their budget constraints for a fixed price vector p: 

max Ue{x) with Bi — {x > s.t. p • y^> p • x} (60) 

Equilibria are states ({s^}, {x^},p*) for which (i) the above problems (59) and (60) are 
simultaneously solved for all i and i and (ii) the aggregate demand of each commodity 
matches the aggregate supply: 

J2{x^e-ye)^J2'i^^ (61) 

e i 

The 'market clearing' condition (61) implies zero waste of resources and ultimately 
determines the optimal price vector p*. 

In order to connect the microscopic efficiency to macroscopic laws, one would like 
to assess the typical values, relative fluctuations and distributions of consumptions. 
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operation scales and prices at equilibrium in a large heterogeneous economy, that is, 
when agents have different technologies, endowments etc. This problem can be tackled 
in its most general form by applying techniques of spin-glass physics. However, a rich 
qualitative description can be obtained already at a less general level, obtained by 
introducing the following assumptions [31,32]: 

a. Consumers: there is only one consumer (the 'society') whose utility function is 
separable: U{x) = '^^u{x^); the functions u are such that -u' > and u" < 

b. Initial endowments: the initial bundle y is a quenched random vector whose 
components are sampled independently for each n from a distribution p{y) 

c. Technologies: the input-output vectors have quenched random components 
that are identically distributed Gaussian random variables with zero mean and 
variance Aj/P satisfying ^^4^ = — with > 0; the quantities Aj are themselves 
quenched random numbers drawn from a distribution g{A) independently for each 



i and — 77-\/A^ 



Let us discuss them briefly. The assumption L — 1 simphfies the thermodynamic hmit 
considerably (in the most general setting, the latter corresponds to diverging N, P and 
L). The separability of U implies that commodities are a priori equivalent. Hence the 
society can increase its utility only by acquiring scarce commodities (ones with low y^^) 
at the expense of abundant commodities (with high y^^). Reaching non-trivial optimal 
states then requires that (i) some commodities are initially more abundant than others 
(one can see that no activity takes place in the case p{y) = 6{y — y)) and (ii) the 
productive sector is able to provide scarce goods using abundant goods as inputs. We 
will see that this last point constitutes a strong selection criterion for technologies. The 
convexity assumptions on u follow the economic literature and are convenient from an 
analytic viewpoint, as will become clear later. Finally, the assumptions on technologies 
guarantee, as in the previous case, that it is impossible to produce all commodities 
without consuming any by simply constructing a suitable combination of technologies 
(if this were possible, operation scales and consumptions would diverge while prices 
would vanish, a situation that is often described as the 'Land of Cockaigne'). We shall 
refer to the case r] ^ 0, which turns out to have special physical properties, as the limit 
of 'marginally efficient technologies'. 

We therefore have a deal of control parameters: A^, P, rj, g{A), u{x) and p{y)- In 
what follows we shall concentrate mostly on the role of rj and of the relative number of 
technologies N/P. In particular, we shall consider the 'thermodynamic hmit' N ^ oo 
with n = N/P finite. 

3.3.2. Statistical mechanics with a single consumer The problem of finding the 
equilibrium can easily be seen to be equivalent to calculating 





(62) 
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In fact, first, if (62) is solved then the society evidently maximizes its utility. On the 
other hand, producers also maximize profits since = ^^dxt^u = Xds^Hi. where the 
last equality follows from the fact that, by virtue of the budget constraint, d.xi^U = 
with A > a Lagrange multiplier. Thus prices disappear from the problem in explicit 
form. However a remarkable outcome of the statistical mechanics approach is that 
average prices and price fluctuations, like other relevant macroscopic observables, turn 
out to be directly connected to or easily derived from the spin-glass order parameters 
that emerge from the calculation, as we shall see later on. 

The statistical mechanics approach starts with the observation that if C/ is a 
sufficiently regular function one expects a self-averaging condition to hold, i.e. 



lim — max U \ y + Sj£,- ) = lim — / / max U \ y + Si$ 

N-.00 N {si>0} I ^ ^ N-^oo N \ \{si>Q} y ^ ^ 




(63) 

where ((■•■)) stands for an average over the quenched disorder {^j}: 

Now the right-hand side of the above expression can be evaluated by introducing the 
'partition function' 

Z = dx e^^(=^) ^ ds5ix-y-^Sii\ (65) 



/(/3)= lim J-((logZ)). (66) 





and defining the 'free energy' 

lim — 

iv^oo j3N 

As usual 

since in the limit (3 ^ oo configurations that maximize U give the dominant contribution 
to the partition function. The evaluation of / ultimately leads to the identification of a 
function G of a vector uj of macroscopic order parameters such that 

lim /(/?) = extr„G'(u;) (68) 

where extr means that the solution is provided by the saddle-point of G, that is by the 
vector uj* solving di^G — 0. The convexity assumptions made on u ensure that the 
relevant saddle point is of replica-symmetric form (as in (19)). Under this condition, w 
turns out to be a six-component vector (a? = {Q, 7, x, x, k, /$}) and G takes the form 

= 7:Qx -7r + + \ "^-^ 
2 2n n \ s>o 



(max u(x) ^ ( X — y + t\/nQ + k] 

n\x>o I ^ ' 2x V / 



tA 

(69) 



CONTENTS 



34 



where (■ • •)^ denotes an average over the random variable x, t is a unit Gaussian random 
variable and averages over A and y are performed with distributions 5'(A) and piy). 
Before discussing the economic interpretation of the order parameters let us notice that 
G is composed of two "representative agent" problems: 

• an 'effective profit' maximization by a representative firm, whose solution reads 

{ta — rjR ^ 
for t > nn/a 
xVA (70) 
otherwise 

where we defined a — — ^ 

• an 'effective utility' maximization by the society with respect to the consumption 
of an effective commodity, whose solution, namely 

X* = x*(t, y) such that xu!{x*) = x* - y + t^/nQ + k (71) 

is always positive provided the assumptions on u arc satisfied 

These two 'effective' problems - which have been derived and not postulated a priori - 
are interconnected by the remaining terms. 

The saddle-point equations ^^JG = for (69) have the following form: 

Q = {^isr\^ (72) 

X = - (ts^^/A) (73) 

a \ / t,A 



K = XK- + nri(^s*VAj ^ (74) 
K = {u 'jx*)),^^ (75) 
{u'{x*y),^^ - {u'{x%^ (76) 

x--^rW^ (77) 

One sees immediately that k represents the optimal average (relative) price. In fact, 
utility maximization under budget constraint gives dx^U — Xp^, with A > a Lagrange 
multiplier that can be set to 1 without any loss of generality. It then follows that a yields 
price fluctuations. It is remarkable that the macroscopic order parameters introduced 
with a purely 'physical' method can be seen to possess such clear economic meanings. It 
is also remarkable that the following laws can be derived, with minimal manipulations, 
from the above set of equations: 

{x* - = -nr) (s*^)^^^ (78) 

{u'{x^) (x* - ^)),^ = ' (79) 

The former expresses the fact that at the relevant saddle point the market-clearing 
condition is satisfied (to compare, just average (61) for L = 1 over /x taking the constraint 
on technologies into account). The latter expresses the fact that at the relevant saddle 
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point the consumer saturates his/her budget when choosing his consumption, a condition 
known in economics as Wahas' law [22]. 

It is possible to obtain a more precise characterization of the macroscopic 
properties by calculating the distribution of operation scales, consumptions and prices 
at equilibrium. These quantities are given respectively by 

poo 

P{s) = {5{s - 0>,A = / ^(A)P(5|A)dA (80) 

Jo 

POO 

P{x) = {d{x - x*)),^^ = J p{y)P{x\y)dy (81) 
P{p) = {5ip - u'{x*))),^^ ' (82) 

where P(s|A) and P{x\y) denote respectively the probability distributions of operation 
scales at fixed A and of consumptions at fixed y. These can be calculated easily from 
(70) and (71). One finds 

P{s\A) = (1 - + ^=|e ^e{s) (83) 

, , l — yu"(x) (x-y-xu'(x)+Kf 

P(x\y) - -^==^e ^ (84) 

where (f) — |[1 — ^rf^^] is the fraction of active firms (i.e. firms such that Sj > 0). 
Moreover, notice that P{s\A) = y/AP{sy/A\l), which implies that 

P(s) = - / eg{k^/s^)P{k\l)dk (85) 
■5 Jo 

Thus, power-law distributed operation scales are found for broad classes of distributions 
(^(A), as P{s) oc when g{A) ~ A^ for A ^ 1. Recently, some empirical evidence 

has been found that distributions of firm sizes (defined by the number of employees, 
profits etc.) have scahng forms [33]. 

Numerical solution of the saddle-point equations for a generic choice of the 
parameters yields the picture illustrated in Figures 13 and 14. The quantity is shown 
in Fig. 13 against numerical simulations. One sees that there are two regimes: one 
where ~ 1/2 for small n, and a second one for large n where n(/) ^ 1, so that the 
number of active firms equals that of commodities, signaling a saturated market. Fig. 14, 
instead, shows that the average scale of production increases when n grows as long as n 
is sufficiently small. This means that the introduction of new technologies (i.e. from an 
increase of n) leads to an increased production activity of existing firms if the number of 
competitors is low. In parallel, relative price fluctuations decrease, as does the average 
level of consumption, signaling that flrms are managing to transform abundant goods 
into scarce ones. When n is close to 2, operation scales become larger and larger as 77 
decreases (i.e. as technologies become more and more efficient) and ultimately develop a 
singularity at ric in the marginally efficient limit 77 — > (see below for more details about 
this limit). The fluctuations of relative consumptions start to drop (the sharper the 
lower is 77), as the distribution of consumptions becomes more and more peaked around 
the mean value. Identifying abundant (or scarce) goods becomes increasingly hard. In 
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Figure 13. Behavior of net) {(p = fraction of active companies) at equilibrium as 
a function of n for r/ = 0.05: analytical prediction (continuous line), computer 
experiments with P = 16 (dotted line) and for P — 32 (dashed line) averaged over 
100 disorder samples. Dots represent results of a single realization of the technologies. 
Inset: </> vs n for 77 = 0.01,0.05,0.1,0.5 (top to bottom). From [31]. 




Figure 14. Typical macroscopic properties of competitive equilibria for g{A) = 
S{A — 1), u{x) = logx and p{y) — e~y . Left panels: typical operation scale (top) 
and relative price fluctuations at equilibrium for different values of rj. Right panels: 
typical consumption and relative consumption fluctuations for different values of rj 
(from [31]). 

high n regime, the introduction of new technologies, by e.g. technological innovation 
(iV — >• + 1), leads to a decrease in the average operation scale, i.e. new profitable 
technologies punish existing ones. The economy becomes strongly selective as firms 
cannot take advantage of the spread between scarce and abundant goods any longer. 
On the other hand, the average consumption starts growing with n, as is expected 
in a competitive economy that selects highly efficient technologies. In this phase the 
introduction of new commodities (an increase in P) leads to an increase in the scale of 
operations. 
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The above results confirm rather clearly that the collective properties of competitive 
equilibria display a marked qualitative change when n increases, as one passes from an 
expanding to a saturated regime around n ~ 2. Such a change is a smooth crossover 
for any finite > 0. However, in the limit ^ in which technologies are 'marginally 
efficient' the crossover becomes a sharp second-order phase transition characterized by 
the fact that = 1/2 for n < ric and 4> < 1/2 otherwise, whereas 

(s'') ~ |n - nd"^/^ |n-nc|<l (86) 

(see [31] for analytical details). This can be explained intuitively by a simple geometric 
argument. Let us write the initial endowments as y'^ = y + 5y^, separating a constant 
part {if) from a fiuctuating part (5?/^) such that Yl^^V^ — 0- Now market clearing with 
r) = implies that V = - Sy, so that all the transformations take place in the space 
orthogonal to the constant vector. This means that those technologies with ■ <5^/ < 
which reduce the initial spread of endowments Sxq lead to a increase in wealth and 
hence will be run at a positive scale. Those with a positive component along Sxq will 
have Si — 0. Given that the probability to generate randomly a vector in the half-space 
■ < is 1/2, when N is large wc expect N/2 active firms. Still the number of 
possible active firms is bounded above by P, hence when n = N/P = 2 the space of 
technologies becomes complete and = y for all fi. There is no possibility to increase 
welfare further. 



3.3.3. Case of many consumers In the model just described, there are A'^ firms running 
linear activities , which are vectors in a P-dimensional commodity space, at a scale 
Sj > 0. These firms face a demand function Q^{p) from consumers, which is the quantity 
that consumers will buy at prices p^. The profit of firm i is given by tt^ = Sj Y^^=iP^Qi ■ 
Let us consider a more general case. Let us assume there are L consumers, each 
with an initial endowment y^ of commodity /i and each taking a share 9u in the profit of 
firm i. We assume that consumers face fixed prices p^. So the initial wealth of consumer 
£ = 1, . . . , L is 

P N 

w, = Y,p^yt + Y.^it^^ (87) 

//=1 i=l 

If consumers are identical, apart from the initial endowments, and aim at maximizing a 
utility function U [x) — Y^^=i log as before, the solution is relatively straightforward: 
the problem of consumer £ is solved by 

(88) 

(i.e. each consumer distributes his wealth uniformly over commodities, taking prices 
into account). Now the total demand function will be 

(.=1 ^ e=i 
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In a pure exchange economy (without production: Sj = Vi) the above quantity will 
equal to total initial endowment of each commodity, i.e. 

L 

If are drawn independently at random with mean y and variance L>, then will have 
mean Ly and variance LD and the relative fluctuations of the total initial endowments 
will be = \fD / {yfLy) , which decreases as L increases. When we allow firms to 
operate (sj > 0), relative fluctuations in the demand must be expected to be of the same 
order 

Therefore, by equation (89), relative price fluctuations will also be of the order 
This simple argument explains how the different macroscopic quantities re-scale in the 
presence of L consumers when L — > oo. We remark that Ref. [31] shows that the 
scales of production have a non-trivial behavior in the limit of extremely uniform initial 
endowments, which suggest an essential singularity (s) ~ exp(— c/-\/L) as L — > oo. 

The case of consumers with different utility functions requires a more involved 
approach, because the heterogeneity of consumer utility is likely to imply a non- 
symmetric demand function (even when prices are all equal). Apart from this, it 
is reasonable to expect that the basic insights gained from the above analysis, such as 
the presence of a cross-over between two structurally different phases of the economy, 
will remain valid. 



3.4. Economic growth: the Von Neumann problem 

Von Neumann's expanding model addresses the issue of computing the maximum 
achievable growth rate of a linear production economy [34]. Economic growth is seen 
basically as an autocatalytic chemical process in which technologies play the role of 
reactions and commodities of reactants. In spite of its extremely simple setup, the model 
has played a key role in the mathematical theory of economic growth, particularly in 
view of its connection to dynamical growth via the so-called turnpike theorems [35]. 

The time-dependent model is defined as follows. One considers an economy with 
P commodities (labeled ^) and linear technologies (labeled i), each of which can 
be operated at a non-negative scale Si > and is characterized by an output vector 
ttj = {af } and by an input vector 6j = {6f }, such that ^jaf (respectively Sibf) denotes 
the units of commodity /i produced (respectively used) by process i when run at scale 
Si. It is assumed that input/output vectors are fixed in time and that operation scales 
are the degrees of freedom to be set, for instance, by firms. At time t, the economy 
is characterized by an aggregate input vector I{t) — ^iSi{t)bi and output vector 
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0{t) — 'Y^. Si{t)ai. Part of the latter will be used as the input at period t + 1 whereas 
the rest, namely 

C{t) = 0{t) - I{t + 1) (92) 

is consumed. In absence of external sources, in order to ensure stability it is reasonable 
to require that inputs at any time do not exceed the outputs at the previous time, i.e. 
one must have C^{t) > for all /i at all times. Let us focus on solutions in which input 
vectors grow in time at a constant rate, i.e. of the form I{t + 1) = pl{t) with p > a 
constant (the growth rate). For these solution, the scales of production have the form 
Si(t) = Sip*', and likewise C{t) — cp^. Therefore the stability condition can be re-cast 
in the form 

c^ = J2si{a^-pK)>0 yp (93) 

i 

The (technological) expansion problem amounts to calculating the maximum p > 
such that a configuration s = {si > 0} satisfying the above condition exists (it is 
easy to show that such an optimal growth rate exists [24]). In such a configuration 
the aggregate output of each commodity is at least p times its aggregate input. If the 
maximum p, which we denote by p*, is larger than 1 the economy is 'expanding', whereas 
it is 'contracting' for p* < 1. On the other hand, the actual value of p* is expected to 
depend on the input and output matrices. Intuitively, p* should increase with the 
number of technologies and decrease when the economy is required to produce a 
larger number P of goods. 

In [36] this problem was tacked in the limit N ^ oo with n = N/P finite 
under the assumption that (af , h^) are independent and identically distributed quenched 
random variables for each i and p, with the aim of uncovering the emerging collective 
properties that are typical of large random realizations of a complex wiring of input- 
output relationship. To begin with, let us write of = a{l-\-a'^) and &f = 6(l-|-/3f ), where 
a and h are positive constants while af , /3f are zero- average quenched random variables. 
Inserting these into (93) one easily sees that, to leading order in N , the optimal growth 
rate p* is given by the ratio a/h of the average output and average input coefficients, 
hence it is independent of the specific input-output network. The non trivial aspects of 
the problem are related to the corrections to the leading part. We therefore write the 
growth rate as 



so that, assuming a = h for simplicity, (93) becomes 



^ , . , ^ , > Vp (95) 

The problem thus reduces to that of finding the largest value g* of g for which it is 
possible to find coefficients (sj > 0} satisfying (95). In the limit N ^ oo one may 
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Figure 15. Behavior of g* /Vkn vs n. Inset: and Vo (related by (100)) vs n 
(from [36]). 



resort to a Gardner-type calculus [37]. Defining the characteristic function 



x{s) = \{e 



1 



af - 



AT 



- 1 + 



(96) 



one can write the typical volume of configuration space occupied by micro-states 
satisfying (93) lor N ^ oo at fixed g is given by 

1 



Kyp(^?)= lim i\ogV{g))) 

N^oo iV 



(97) 



(98) 



where V{g) is the volume of solutions at fixed disorder: 

V{g)= j\{s)6 {y.s,-N^ ds 

(without affecting the optimal growth rate, we introduced a linear constraint Sj = 
N). It is reasonable to expect that, when g increases, T4yp(fl') shrinks, and in particular 
that Vtyp{g) lor g ^ g*. Now after carrying out the disorder average (see [36] for 
details), which only depends on 



k 



(99) 



the key macroscopic order parameters turns out to be the overlap qw = {l/N) susn' 
between different optimal configurations i and £'. Because the space of solutions 
{si} is a convex set (by construction), the replica-symmetric approximation, for which 
Qw — Q + X^w is in this case exact. Note that x, which describes the fluctuations of Sj 
among feasible solutions, should also vanish iis g ^ g*, hence the conditions g = g* and 
X = are equivalent and the analysis of optimal states coincides with the study of the 
X — > limit of the replica-symmetric solution. 

Results for the re-scaled quantity g* / \fnk are shown in Fig. 15. The line separates 
the region of feasible solutions with g < g* from the region of unfeasible solutions, g* 
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crosses the line g = Q (i.e. passes from a regime with growth rate p < a/h to one with 
growth rate p > a/h) at Uc = 1. In the inset wc show the fraction of inactive processes 
V^o (i.e. such that Sj = 0) and that of intermediate commodities (po (i.e. such that 
= 0) at (? = g*, as a function of n. These are found to be universal functions of n 
independent of the details of the disorder distribution, related by 

00 = n{l - ^o) (100) 

Both 00 and i/jq tend to one when n increases, meaning that the 'expanding phase' at 
n > Uc is highly selective. The condition (100) has a simple geometrical interpretation: 
it implies that the number of active processes equals that of intermediate commodities 
at g*. Noting that for any p such that = we have a linear equation for the scales 
Si > 0, we see also that (100) simply corresponds to the requirement that the number 
of equations should match the number of variables. 

Based on these results one can speculate on how long term growth will be affected 
by technological innovation. The latter, defined as the introduction of new processes, 
i.e. new feasible ways of combining inputs to produce desirable outputs [38] would 
just correspond to an increase in the number of transformation processes which the 
economy has at its disposal. Now the change in the growth rate is related to the change 
in gf*/-^/n, which is given by 

Therefore an increase in N can have a large positive impact on long term growth when 
n is small. For technologically mature economies (n 3> 1) instead, g*/\/n increases 
much more slowly, hence technological innovation has much smaller effect on long term 
growth. 



4. Toy models of financial markets: Minority Games 

4.1. Introduction 

The Minority Game (MG for short) [39] is a strict relative of the El Parol problem (it 
corresponds roughly to the case L = N/2) that has been proposed to model speculative 
trading in financial markets, that is systems where agents buy and sell asset shares 
with the only goal of profiting from price fiuctuations. The basic idea is that when most 
traders are buying it is profitable to sell and vice- versa, so that it is always convenient to 
be in the minority group. Abstracting, one considers the following situation. We have N 
agents, each of which has to formulate at every time step t a binary bid 6j(t) e {—1, 1} 
(buy/sell). The payoff received at time t by each agent depends both on his/her action 
and on the aggregate action A{t) — J^i^ii^) (^^^ 'excess demand') and it is given by 
7rj(i) = —bi{t)A(t). Thus, agents in the minority group win. The minimal measures of 
efficiency to be employed are the average excess demand and fluctuations in the steady 
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state: 

(A)= lim TfA7f^Y.A{t) and a' = {A') (102) 

t — J eq 

where Teq is an equilibration time. An efficient state is one wfiere {A) — and cr^ is 
small. Notice that the number of people which could have been accommodated in the 
minority is \A\/2, hence cr is a measure of the waste of resource. What remains to be 
specified is how agents make their decisions. Agents who buy or sell at random with 
equal probability at every time step lead to a state where {A) = and = N. Of 
course, it is the way in which agents take their decisions (which needs to be specified) 
and their interactions that gives rise to the complex collective behavior. 

The MG is a useful toy model that allows to elucidate the collective behavior of 
systems of heterogeneous interacting agents by addressing directly the interplay between 
microscopic behavior and macroscopic properties (fluctuations, predictability, efficiency, 
etc.). Prom a purely theoretical viewpoint, the detailed study of the emergence of 
cooperation in competitive systems makes the Minority Game a benchmark model of 
interacting agents. It has however also turned out to be able to reproduce, to some 
extent, the rich statistical phenomenology of financial markets, that are well known (and 
at least since [40]) to be characterized by clear statistical regularities, often referred to 
as "stylized facts" §. 

There are at present a few comprehensive books that cover many aspects of the MG, 
from both the theoretical viewpoint and the financial market viewpoint [42,43]. Here 
we shall consider some basic and extended aspects of the model that are only marginally 
treated elsewhere. In this section we shall concentrate mainly on the original model, 
first presenting a more thorough derivation of the minority rule, then a simple version of 
the MG and finally discussing the standard model. The next section is instead devoted 
to some extensions that have a particularly interesting physical content. 



4-2. From agents' expectations to the minority (and majority) rule 

The connection between MGs and financial markets can be established naively by 
observing that markets arc instruments for allocating goods. This, combined with the no 
arbitrage hypothesis according to which no purchase or sale by itself may result in a risk- 
less profit, suggests that markets should in principle be zero-sum games. Transaction 
costs make it a game that is unfavorable on average, i.e. a Minority Game. It would 
however be important to understand whether the minority mechanism can be derived 
from a particular microscopic scheme. This is indeed possible [44] . 

§ An ever increasing number of such facts are documented in the hterature. The best known of these 

arc the following: (a) asset returns arc approximately uncorrclatcd beyond a time scale or the order of 
tens of minutes; (b) the unconditional distribution of returns displays a power-law tail with an exponent 
ranging from 2 to 4 for different stocks and markets; (c) the distribution of returns over a time scale 
T becomes more and more Gaussian as t increases; (d) volatility is positively autocorrelated over 
time scales as long as several days, implying that periods of high volatility cluster in time ('volatility 
clustering'). See [41] for details. 
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Let us imagine a market in which N agents submit their orders ai{t) for a certain 
asset simultaneously at every time step t = 1,2,.... Let aj(t) > mean that agent 
i contributes ai(t)'€. to the demand for the asset while aj(t) < means that i sells 
—ai{t)/p{t — 1) units of asset, which is the current equivalent (i.e. at price p{t — 1)) of 
\ai{t)\€. With ai{t) = ±1 and A{t) = the demand is given by D{t) = 

whereas the supply is S{t) = 2p(t^^i) ■ Filially, assume that the price is fixed by the 
market clearing condition, p{t) — D{t)/S{t), i.e. 

,N + A(t) 

Taking the logarithm of both sides and expanding to the leading order one gets 

logp(i) - logp{t - 1) ~ (104) 

A 

with X — N. The quantity on the left-hand side is normally called the 'return' of the 
asset. A{t) is instead the excess demand, namely the difference between demand and 
supply. This equation expresses the dynamics of prices in terms of an aggregate quantity 
A{t) that all agents contribute to form [45]. A{t) may thus be considered a proxy for 
the return. 

Now take agent i and assume he must decide whether to buy or sell at time t. To 
do this, he should compare the expected profit (or utility) of the two actions, which 
depends on what the price will be at time t + 1. For instance the utility he would face 
at time i + 1 if he buys 1€ of asset at time t (i.e. ai{t) — 1) is given by 

U^{t) = - 1 (105) 

{ui{t) > if p{t + 1) > p{t)). At this stage the price p{t + 1) is unknown to him (and 
presumably to everybody else). Therefore if our agent i wants to use Eq. (105) to make 
his choice at time t. he has to replace p(t+l) by the expectation he has at time t of what 
the price will be at time t + 1, denoted by M^ ^\p{t + 1)]. Let us assume that that [44] 

E«[p(i + 1)] = (1 - xlj,)p{t) + i^ip{t - 1) (106) 

The parameter ■^j allows to distinguish two types of traders, depending on whether ■^j 
is positive or negative. Agents with V'i > believe that market prices fluctuate around 
a fixed value (the 'fundamental'), so that the future price is an average of past prices. 
For this reason these agents arc called 'fundamentalists'. They may also be called 
contrarians since they believe that the future price increment Ap(i-|- 1) = p{t + l) —p{t) 
is negatively correlated with the last one 

Ef\^p{t + l)]^-'il^Ap{t). (107) 

On the other hand, if ■0i < the agent believes that the future price increment will 
occur in the direction of the trend defined by the last two prices, so that future price 
increments Ap(t + 1) are positively correlated with the past ones, as if the price were 
following a monotonic trend. This type of agents are called 'trend followers'. 
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The expected utility for buying at time t will be W^' \ui{t)\ai{t) — +1] = —'^i\p{t) — 
p{t — l)]/p{t) which, using (103), becomes 

E«K(i)|a,(t) = +1] = -2il;iA{t)/[N + A{t)] (108) 

A similar calculation can be carried out for the expected utility for selling at time t. 
The net result is that the expected utility for action ai{t) at time t can be written as 

Notice that agents who took the majority action ai{t) — sign[A(t)] expect to receive 
a payoff —2tlji\A{t)\/[N + \A{t)\] whereas agents in the minority group expect to get 
2ipi\A{t)\/[N — \A{t)\]. It is clear that the expected payoff of fundamentalists (resp. 
trend-followers) is positive when they are in the minority (resp. majority) group. 
Therefore Minority Games are simple schemes for describing the behavior of contrarians 
whereas Majority Games are appropriate for trend-followers. 

In real markets, both groups are present and the resulting price dynamics stems 
from a competition between the two groups [46]. Which group dominates and shapes 
the price dynamics depends on the evolution of traders' expectations, which in turn 
depends on the behavior of price itself. Common sense suggests that when everybody is 
going to buy the price will rise and it will be convenient to buy. Accordingly, speculative 
markets in certain regimes (e.g. bubbles) should look more like Majority Games rather 
than Minority Games (and vice- versa in other regimes). If all traders base themselves on 
the same price history, expectations should converge and traders would end up playing 
either a Majority or a Minority Game. But of course agents revise and calibrate their 
expectations according to the real price history so fundamentalists and trend-followers 
coexist symbiotically in real markets. The problems with arguments in support of either 
the Minority or the Majority Game essentially arise from the fact that the objective 
assessment of the validity of a trading strategy is a complex inter-temporal problem that 
cannot be based on the result of a single transaction: whether buying today is profitable 
or not depends on what the price will be when one sells. Hence the payoff of a single 
transaction is hardly a meaningful concept unless one considers round-trip (buy/sell or 
sell/buy) transactions. From this point of view the MG is a rather crude approximation. 
Yet, we shall see below that it provides a remarkably rich and realistic picture of financial 
markets as complex adaptive systems. Models of interacting fundamentalists and trend- 
followers will be addressed in the following section. 

4-3. The simplest Minority Game 

Before considering the model in its full complexity, it is instructive to to take a glimpse 
at a minimal version with inductive agents in which the collective behavior can be 
easily understood with simple mathematics [44]. Let us suppose that traders employ a 
probabilistic rule of the form 

Prob{6i(t) = b} = C{t) exp [bAi{t)] b e {-1, 1} (110) 
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where C{t) is a normalization factor and Aj(t) accounts for the agent's expectations 
about what will be the winning action (if Aj(t) > then he/she will choose hi{t) — 1 
with higher probability). The 'score function' Aj is updated according to 

A,(i + 1) - A,(i) = -VA{t)/N (111) 

with r > a constant, so that if A{t) < agents increase Aj and the probability of 
choosing action 1. Let us finally assume that the initial conditions Ai(0) are drawn 
from a distribution po(A) with standard deviation s. How does the collective behavior 
depend on the parameters V and si 

Notice that y{t) = Aj(t) — Aj(0) does not depend on i, for all times. For ^ 1, 
the law of large numbers allows us to approximate A{t) by its average with probability 
distribution (110). This yields an approximate dynamical equation for y{t): 

y{t + 1) ~ y{t) - r (tanh[y(i) + A(0)])o (112) 

where the average (. . .)q is on the distribution po of initial conditions. Eq. (112) admits 
a fixed point y{t) = y*, with y* the solution of (tanhfy* + A(0)])q = (A) = 0. Let us 
assume that this solution is stable. This describes a stationary state where the relative 
scores Aj(t) are displaced by a quantity y* from the initial conditions. This gives 

N 
i=l 

Notice that a'^ oc N and it decreases with the spread of the distribution of initial 
conditions. A hnear stabihty analysis of Eq. (112) shows that these solutions are stable 
only when 

2 2N 

r < r = = = — fii4) 

' 1 - (tanh[|/* + A(0)]2)o ' ^ ' 

When r > Fc one finds periodic solutions of the form y{t) = y* + z*(— 1)* where y* and 
z* satisfy certain prescribed conditions. The parameter z* plays the role of an order 
parameter of the transition at Fc {z* = for F < Fc). Again we have (A) = 0, but now 

^2 ^ ^^.2 (tanh[y- + z* + A{0)])1 + (tanh[y- - z* + A(0)])g ^^^^^ 

i.e. fiuctuations arc proportional to A^^. Hence this is a much less efficient state. The 
orbits of the dynamics of y{t) for F < F^, and F > F^ are shown in Fig. 16 together with 
the behavior of a^/N^. We conclude that the more heterogeneous the initial condition 
is, the more efficient is the final state and the more the fixed point y* is stable. The 
transition from a state where a'^ ex N to a, state with cr^ oc N"^ will turn out to be a 
generic feature of MGs. 

4.4- The Minority Game 

In the simple case discussed above, agents base their choice only on their past experience. 
The standard Minority Game describes a more general situation in which traders use 
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Figure 16. Left panels: the map y{t) for T = 1.8 < Yc and T = 2.5 > Vc for s = 0. 
Right panel: global efficiency a'^ /N"^ as a function of F for two different sets of initial 
conditions: Ai(0) is drawn from a Gaussian distribution with variance . The full 
line corresponds to s = 1/2 whereas the dashed line is the result for s = 1. The inset 
reports the critical learning rate Fc as a function of the spread s of initial conditions. 

both their past experience and some (endogenous or exogenous) information pattern. 
The model is defined as follows [47]. There are N agents labeled i. At each time 
step t agents receive one of P possible information patterns ^{t) (whose precise nature 
will be discussed below) based on which each trader must formulate a binary bid 
hi{t) G { — 1,1}. To this aim, each of them is endowed with 5* strategics a^g = {af^} 
{g = 1, . . . , S*) that map informations fi G {1, . . . , P} into actions af^ G { — 1,1}. Each 
component a^^ of every strategy is selected randomly and independently from { — 1,1} 
with equal probability for every i, g and n at time t = and is kept fixed throughout the 
game. Agents keep tracks of the performance of their strategies by means of valuations 
functions or scores Uig that are initialized at some value Uig{0) and whose dynamics 
reads 

U,g{t + 1) - U,g{t) = -a^^'^A{t)/N (116) 

where A{t) = 6j(t) is the excess demand at time t. At each round, every agent picks 
the strategy gi{t) = arg maxgUig{t) carrying the highest valuation and formulates the 
corresponding bid: bi{t) = a^g^Jly In this way, agents adopt at each time the strategy 
they expect to deliver the highest profit (the score of strategies forecasting the correct 
minority action increase in time). 

The nature of the information patterns ^{t) is still to be specified. In principle, the 
natural choice corresponds to taking the string of the past m minority actions (hence 
P = 2™) as the information fed to agents at every time step, with the idea to describe a 
closed system where agents process and react to an information they produce themselves 
collectively. We refer to this choice as the case of endogenous information. On the other 
hand, one may think of replacing for the sake of simplicity the above information (which 
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has a non-trivial dynamics itself) with an integer drawn at random at each time step 
from {1,...,P} with uniform probability. This corresponds to the case of random 
exogenous information [48]. Again, this replacement induces a major simplification in 
the structure of the model by turning a complex non-Markovian system with feedback 
into a Markovian one. In addition and at odds with the El Parol problem, it was shown 
that collective properties are roughly unaffected when real information is substituted 
with random information. These results suggest that, to some extent, the feedback is 
irrelevant as far as collective properties are concerned. We shall hence focus on the case 
of exogenous information for the following sections. A more careful discussion of the 
subtle case of endogenous information will be deferred to Sec. 4.8. In summary, the 
Minority Game is completely defined by the following rules: 

gi{t) = arg maxC/ig(i) 

9 

i 

Ui,{t + 1) - U,,{t) = -4'^A{t)lN 

Let us now discuss the macroscopic properties of the model. Early works focused 
on the cooperative properties of the system in the stationary state. The central quantity 
of interest is the numerical difference between buyers and sellers at each time step, A{t). 
It is easy to anticipate that none of the two actions —1 and 1 will systematically be the 
minority one, i.e. that A{t) will fluctuate around zero. Were it not so, agents could easily 
improve their scores by adopting that strategy which visits most often that side. The size 
of fluctuations of A(t), instead, displays a remarkable non-trivial behavior. The variance 
cr^ = {A^) of A{t) in the stationary state measures the efficiency with which resources are 
distributed, since the smaller cr^, the larger a typical minority group is. In other words 
(7^ is a reciprocal measure of the global efficiency of the system. Early numerical studies 
have shown that the relevant control parameter of the model is the relative number of 
information patterns a — P/N. The behavior cr^ is illustrated in Fig. 17. With a fixed, 
one typically observes that A{t) ~ or equivalently that cr^ = 0{N). When a ^ 1 
the information space is too wide to allow for a coordination and agents essentially 
behave randomly as a'^/N ~ 1, the value corresponding to random traders. As a 
decreases, that is as more and more agents join the game or as the possible number of 
information patterns decreases, a"^ /N decreases suggesting that agents manage to exploit 
the information in order to coordinate to a state with better-than-random fiuctuations. 
It turns out that these steady states are ergodic, that is they are reached independently 
of the initial conditions Uig{Qi). Lowering a further, ergodicity is lost and the steady 
state depends on C/jg(0). For the so-called flat initial conditions, C/ig(0) = for all i and 
which describe agents with no a priori bias toward one of their strategies, one is driven 
into highly inefficient steady states where cr^ diverges as a decreases approximately as 
0"^ ~ 1/a. Notice that this implies o"^ ~ A^^. This behavior for has been attributed 
to the occurrence of "crowd effects" . Remarkably this ergodicity breaking transition is 
related to a phase transition with symmetry breaking that was first discovered by Savit 
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Figure 17. Behavior of /N and H/N versus a (analytical and numerical) for 
different initial conditions y{Q) = J7ii(0) — f/i2(0). 



and coworkers [49] for the case of endogenous information. Reporting the frequency 
with which the minority action was 1 conditional on the value of they observed that 
for a <^ 1 the minority was falling on either side with equal probability irrespective of /x. 
But when a ^ 1 the minority happened to be more likely on one side, depending on the 
value of These observations have been sharpened in a study that allowed to locate 
the phase transition at the point «c — 0.34 for S* = 2 where o"^ attains its minimum 
(see next section for details). The transition separates a symmetric {a < from an 
asymmetric phase (a > etc)- The symmetry which is broken is that of the average of 
A(t) conditional on the history ji, {A\ij). The idea is that if {A\ij) ^ for a certain 
^ then the knowledge of alone suffices for a non-trivial statistical prediction of the 
sign of A{t). In the asymmetric phase, ^ for at least one ji. Thus the sign of 

A{t) is predictable, to some extent, on the basis of ji alone. A measure of the degree of 
predictability is given by the function 

(118) 

(1=1 

In the symmetric phase {A\ii) = for all /j, and hence H = 0. if is a decreasing function 
of the number N of agents (at fixed P): newcomers exploit the predictability of A{t) 
and hence reduce it. The behavior of H is also reported in Fig. 17. Notice that it acts 
like a 'physical' order parameter. 



4-5. Statistical mechanics of the MG: static approach 

We shall discuss in this review two fines along which the statistical mechanics of the 
Minority Game with random external information can be studied. The first one is 
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a static theory whose crucial steps are (a) finding a (random) Lyapunov function of 
the dynamics that allows one to identify the steady states of the learning process 
with its minima; (b) calculating the latter via the replica method. The second one 
consists in constructing a dynamical mean-field theory using the learning dynamics 
as a starting point. The two approaches are essentially complementary: the statics 
gives more information about the predictabihty and allows to interpret the collective 
properties in terms of a minimized quantity; the dynamics focuses on ergodicity and 
is a more appropriate setting to discuss fluctuations. Below we will outline the static 
approach to the standard MG for the case S — 2, deferring a discussion of the dynamical 
method to Sec. 5.3. Other possibilities, like the 'crowd-anticrowd' theory [50] will not 
be discussed here (an account can be found in [6]). 

It is helpful for a start to introduce the auxiliary variables [47] 

aii-ai2 aii + ai2 Uii{t)-Ui2{t) 

^ , 2 ' 2 ^ ^ 

in terms of which (116) can be re-cast as 

y^{t + l)-yi{t)^-^^f^A{t) (120) 

The advantage lies in the fact that the dependence of aig^(^t) on the strategy valuation 
can be made exphcit by noticing that gi{t) — Hi yi{t) > and gi{t) = 2 if yi{t) < (we 
shall therefore refer to as the 'preference' of agent i). As a consequence, the relevant 
microscopic dynamical variable is the Ising spin Si{t) — sign[yj(i)]. On has in particular 

aig,(t) = + Si{t)^i (121) 

i i 

The dynamics (120) is non-linear in a way that doesn't allow to write it in the form of a 
gradient descent. However, as in the El Parol problem, one may regularize the dynamics 
via a learning rate F > such that [51] 

Proh{gi{t) = ^} = C{t) e^^'«(*) C{t) = normahzation (123) 

It is then possible to construct the continuous-time hmit of (120) in view of the fact 
that the dynamics possesses a 'natural' characteristic time scale given by P. Proceeding 
as shown for the El Parol case, one arrives at the following continuous-time Langevin 
process [16]: 

Viir) = -pi - ^e^tanh[y,(T)] + Zi{r) (124) 

j 

{z,{r)z,{T'))c^^M'AT-T') (125) 

where r = Vt/P is a rc-scaled time and cr^ is the volatility|| and the over-line denotes 
an average over ji. One sees that in the limit P ^ 0, in which the dynamics becomes 

II Eq. (125) is based on a time-independent volatility approximation which happens to be very well 
satisfied away from the critical line. We refer the interested reader to [16] for further details. 
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deterministic, the system performs a gradient descent with a well-defined Hamiltonian. 
Indeed, in order to extract the steady state from the above process, one may take its 
time average: 

{Vi) = -e^n - J^e^m,-, rui = (tanh(yO) e [-1, 1] (126) 

i 

It is now clear that the stationary values of the variables rrii can be obtained from the 
minimization of 

2 

_ (127) 

which coincides with the predictability in the steady state. Hence agents coordinate so 
as to make the market as unpredictable as possible. This conclusion remains correct 
even for F > 0: indeed are still given by the minima of though the dynamics is 
no more deterministic (see [16]). Actually, within the approximation of Eq. (125), it 
can be shown (see Sec. 4.6) that for a> ac the steady state is independent of F. 
As usual, minimization of H is achieved through the replica trick as 

lim — ((min— \\ = lim lim lim ^— log (( [Tr^e-^^D) (128) 

The calculation is detailed at length in the literature (see e.g. [52]). The resulting phase 
structure is as follows: 

• for a larger than a critical value etc = 0.3374 . . . there is a unique (F-independent) 
minimum with H > 

• for a < a^, there is a continuous of minima where H vanishes. The minimum 
selected by the dynamics depends on initial conditions (and on F) 

Hence the system at a^. undergoes a phase transition from a predictable to an 
unpredictable phase. Such a static transition corresponds to a dynamical instability in 
the dynamics of preferences for F = 0. To see this, let us first mention that in numerical 
simulations one observes that Ui either grows linearly with time or stays finite. Based 
on this, one can conclude that solutions of (126) are of the form (yj) = Vit, with 

Vi^-'^-Y.^j^j (129) 
j 

and that there are two possibilities: 

• either Vi ^ and yi{t) diverges as t ^ oo, in which case = sign{vi) the agent 
ends up using just one of his strategies (we call these agents 'frozen') 

• or Vi — and (yi) stays finite, in which case — 1 < < 1 and the agent keeps 
fiipping between his strategies (we call these agents 'fickle') 

Let us consider the dynamics of preferences for fickle agents. Setting ^^(t) = (yi) -|-ej(r) 
where ej(r) describes small fiuctuations about the average, one can expand (124) to first 
order in ei{t): 

^ii^) = - E ^(1 - "^'Mt) = - E ^^^■^^•(^) (130) 

j fickle j fickle 
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where T^j = — m^). As long as the matrix T = (Tij) is positive definite, the 

above dynamical system will be linearly stable. Now T = UV with Uij ~ and 
Vij = (1 — mf)5ij. But for fickle agents (l^il < 1) all eigenvalues of V are positive 
definite, so that det(T) vanishes together with det(C/). The spectrum of the random 
matrix U can be evaluated using random matrix theory. For our purposes it suffices to 

calculate the minimum eigenvalue, which turns out to be Aq = | ^1 — \J^-^^ ■ The 

instability sets in when Aq = 0, that is when 

l-(t)^a (131) 

This equation and the distinction between fickle and frozen agents only depend on m^, 
which are determined for o; > etc by the unique minimum of if, independently of F. 
Hence Eq. (131) and the location etc of the phase transition, are independent of F. 



4.6. The role of learning rates and decision noise 

It is interesting to consider briefly the impact that the introduction of a finite learning 
rate F has on the properties of the model. Let us begin by noting that F, which at the 
level of agents plays a role similar to an 'inverse temperature', at the collective level acts 
instead as an effective 'temperature', since it tunes the fiuctuating random component 
in agent's dynamics (see (125)). The larger F or, equivalently, the smaller the minimum 
score difference agents can appreciate (this quantity is roughly of order 1/F), the more 
the response fluctuates and the longer it takes to average fluctuations out and reach a 
steady state. 

We have anticipated above that F affects the steady state only in the sub-critical 
phase. Its effect is particularly strong on the volatility, which can be written as 

= + J^eF(l - ml) + J]e^((tanh7/, - mO(tanh|/, - m,)) (132) 

The dependence on F is only present in the last term on the right-hand side, which 
measures fluctuations of tanh yi around its mean. The average is over the distribution 
of (which in turn depends on cr^ via the noise). The latter can be computed from 
the Fokker-Planck equation associated to (124), which itself depends on o"^ (see 125). 
Hence o"^ is determined by the solution of a self-consistent problem [16]. For a > ac, 
fluctuations of t/j are independent and hence the third term of (132) is identically zero. 
As a result, is independent of F, as confirmed to a remarkable degree of accuracy 
by numerical simulations [16]. When a < ac a. correlation arises from the fact that the 
dynamics is constrained to the subspace of y which is spanned by the P vectors and 
which contains the initial condition 2/(0). The dependence on initial conditions and the 
dependence on F both arise as a consequence of this fact. Again, numerical simulations 
fully confirm this picture [16]. 

It is worth remarking that, the smoothed choice rule (123) can also be written as 

Si{t)^sign[yi{t) + Ci{t)/r] (133) 
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where C,i{t) are independent identically distributed random variables with probability 
density p{C,) = ^ [1 ~ (tanh(^)^]. Indeed, for F = the noisy part of the argument of the 
sign dominates and the agent selects his strategy at random with equal probability 
at each time step, while for F — > oo one recovers the original deterministic rule 
Siit) = sign[y i{t)]. 

On the basis of this observation, Coolen et al. [53] introduce a different type of 
decision noise, called 'multiplicative noise', defined as 

Si{t)=sign[y,{t){l + Q{t)/T)] (134) 

which corresponds to 

Prob{si(t) = ±1} = C{t) e^r^'s^lf'Wl C{t) = normalization (135) 

It is evident that in this case frozen agents are affected as well. Indeed, the critical point 
etc turns out to depend rather strongly on F: when F gets smaller the informationally 
efficient phase shrinks as the critical point shifts to smaller values of a. 

4-7. The role of market impact 

Ever since J. Nash's pioneering work in game theory, that of Nash equilibrium (NE) 
has been a reference concept in socio-economic systems of interacting agents. A NE 
is in some sense an optimal state of strategic situations, one in which no agent has 
incentives to deviate from his behavior unilaterally. It is easy to see that, a priori, the 
Minority Game possesses a huge number of such states when N ^ 1. In fact, there 
is one symmetric NE in mixed strategies, where agents draw their bid bi at random at 
every time step with Prob{6i = +1} — 1/2 for all i. This state has a'^ = N and H = 0. 
If N is even, there arc also (^^2) P^^^s strategy NE where half of the players take bi = +1 
and the other half takes bi = —1. Moreover, states where N — 2k agents play mixed 
strategies and the remaining 2k play pure strategies 6j = +1 and bi = —1, are also NE. 
Thus the game possesses an exponentially large number of Nash equilibria. One can then 
ask whether the steady state of the model is one of them. The answer is a resounding 
no. In this section we will study this issue and discuss the important question of why it 
is so. Why are inductive agents playing sub-optimally? We shall see that at the heart 
of the matter lies the consideration which agents have of their market impact, i.e. of 
their impact on the aggregate quantity A{t). In fact, the inability to coordinate on a 
NE follows from the naive idea that in a system of N agents every single agent 'weights' 
1/A^ and is thus negligible in the statistical limit N ^ 00. Once this assumption is 
dropped and agents account for their own impact, the resulting steady state improves 
dramatically and eventually a NE may be reached. 

To begin with, it is instructive to study the role of market impact in the simplest 
MG with P — 1 discussed in Sec. 4.3, in which agents must choose at each time step 
between the two actions e {—1,1}. Let us consider the following modification of the 
learning dynamics (111): 

Ai{t + 1) - Ai{t) = -^AA{t) - rja,{t)]. (136) 
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The term proportional to 7] in (136) describes the fact that agent i accounts for his 
own contribution to A{t). One indeed sees that (136) reduces to (111) for 77 = 0, 
whereas for 77 = 1 agent i considers only the aggregate action of other agents, 
A{t) — ai{t) = "^j^iCLjit), and does not react to his own action aj(t). Values of rj 
between and 1 tune the extent to which agents account for their "market impact" . 

It is easy to see that the dynamics for 77 = 1 behaves in the long run in a radically 
different way than for 77 = 0. Let us take the average of (136) in the steady state and 
define rrii — (0^). We note that 



{Mt + i))-{\{t))^-^ 



rrij — Tjuii 



r OH^ 

'N~dri' 



(137) 




where 



2 

This implies that the stationary values of the ttt-j's are given by the minima of H^. 
Notice that Hi is a harmonic function of the m^'s. Hence it attains its minima on the 
boundary of the hypercube [—1, 1]^. So for 77 = 1 all agents always take the same actions 
ai{t) =171,1 = +1 or ai{t) = rrii = —1 and the waste of resources is as small as possible, 
as = or 1 if is even or odd, which is a tremendous improvement with respect to 
the case rj = (where a'^ ~ or iV^). These states are indeed Nash equilibria of the 
associated N persons minority game. This argument can be extended with some work 
to all 77 > 0, and one can show that the stationary states of the learning process for any 
77 > are Nash equihbria. Hence as soon as agents start to account for their market 
impact (7/ > 0) the collective behavior of the system changes abruptly and inefficiencies 
are drastically reduced. Furthermore, the asymptotic state is not unique (if^ possesses 
more than one minimum!) and the one in which the system settles is selected by the 
initial conditions. The set of equilibria is discrete and the system jumps discontinuously 
from an equilibrium to another, as the initial conditions Aj(0) vary. This also contrasts 
with the 77 = case, where the equilibrium shifts continuously as a function of the initial 
conditions. 

Let us now consider the full MG with market impact correction with public 
information [52] (the above picture is representative of the situation in the MG in the 
limit q; — > 0), whose learning dynamics reads 

U,,{t + 1) - U,,{t) = [A{t) - V (a/:^ - af)\ (139) 

As before, 77 allows to interpolate between the naive 'price-taking' behavior of the 
standard MG in which agents are unaware of their market impact (7/ = 0) and a more 
sophisticated behavior where agents account for it. Note indeed that with 77 = 1 the 
reinforcement Uig{t + 1) — Uig{t) is proportional to the actual payoff that agent i would 
have got had he actually played strategy g at time t. Hence in a way the above learning 
process assumes that agents are able to disentangle their contribution from the aggregate 
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A{t). This may not be reahstic in practical situations. For example imagine that, as in 
the original version of the MG, agents only observe the sign of A{t) and not its value. 
This information is not enough to infer the sign of A{t) — dig^j^-^ + a^^*^ and hence the 
payoff they would have received if they had played strategy g instead of gi{t). However, 
agents can approximately account for the market impact by rewarding the strategy they 
have played by a reinforcement factor rj, i.e. 

Ui,At + 1) - U,,{t) = -af^ + (140) 

In fact, the collective behavior of the learning dynamics above is identical to that 
obtained with (139). This is because what matters in the long run is the time average 

of the processes, which is the same because {aigaig') ^ Sg,g'- 

At first sight, the term proportional to r] looks negligible with respect to A(t) 
because it is of order one whereas A{t) = 0{^/N). However while A{t) fluctuates 
around zero, Ss,si{t) has always the same sign. When the term proportional to A(t) is 
averaged over the P = aN states /i it also becomes of order one. Hence the effect of 
the two terms is comparable in the long run. (A similar phenomenon occurs in spin 
glasses where the naive mean filed theory has to be corrected by the Onsager reaction 
term to eliminate self- interaction effects.). For generic 77 (0 < 77 < 1) the steady state is 
described by the minima of 

H,^H-7jJ2^{l-m^) (141) 

i 

where H — is the predictability. Note that Hi — a'^, so players who fully account 

for their impact effectively minimize fluctuations. 

Unfortunately, the study of the ground state properties of requires techniques 
which are more sophisticated than those used for the MG. Indeed for 77 > the simple 
replica-symmetric solution that we have discussed so far becomes unstable against 
perturbations that break replica permutation symmetry (this is related to the fact 
that has more than one minimum) and one needs to study more complicated 
solution types [54]. The ensuing phase structure is shown in Fig. 18 The critical 
line (analog to the de Almeida-Thouless line of spin- glass theory) can be calculated 
straightforwardly using the dynamical stability argument mentioned at the end of Sec. 
4.5. It suffices to replace Uij — ^i^j with Uij — ^i^j — rjSij^f. The resulting condition 
reads 1 — = a{l — s/riY and coincides with the critical line for replica-symmetry 
breaking. 

The MG behavior [r] = 0) is separated from the Nash equilibrium behavior (77 = 1) 
by a phase transition which is continuous for a > Uc- Remarkably for a < the 
transition occurs at 7/ = and it becomes discontinuous. As shown in Fig. 19, nothing 
dramatic happens when crossing the transition for a > Uc- For a < ac instead a'^ /N 
features a discontinuous jump across the transition line at 77 = 0. The origin of the 
discontinuity lies in the dynamic degeneracy of the system for a < ac and 77 = 0. 
Even an infinitesimal change in 77 can dramatically alter the nature of the minima of 
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Figure 18. Phase diagram of the Minority Game in the (a, 77) plane. The RS 
region corresponds to the replica-symmetric phase and the RSB region to the replica 
symmetry broken phase (from [54]). The mark corresponds to the critical point 
etc — 0.3374. Above it, the RSB RS transition is second order; below it, it is 
discontinuous. 



10' 



10" 



10 ' 



10' 



■ P=64, N=1 01 : minority game 
□ minimization of H 

♦ P= 8, N=1 01 : minority game 
o nninimization of H 



^□□□□□CfcBnnBnnn"°°°°°°°' 



-0.5 



0.5 1 



♦ ♦ 

1.5 



Figure 19. a'^/N as a function of rj for S = 2 and a ~ 0.079 < ccc — 0.3374 and 

a ~ 0.63 > Results both of numerical simulations of the minority game and of the 
numerical minimization of Hr^ are shown. In both cases the replica symmetry breaks 
at r; = (from [52]). 



Hn'. for negative rj there is only one minimum which becomes shallower and shallower 
as 77 — > 0~. At 77 = the minimum is always unique but it is no more point-like. 
Rather it is a connected set. An infinitesimal positive value of 77 is enough to lift 
this degeneracy. The set of minima becomes suddenly disconnected. At fixed a < ac, 
varying 77 across the transition changes continuously - with a discontinuity in its first 
derivative - whereas the remaining fluctuation terms in a'^/N change discontinuously 
with a jump. The potential implications of this result are quite striking: rewarding the 
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Figure 20. Logarithm of the average number of NE divided by A'' as a function of a 
(from [54]). 

strategy played more than those which have not been played by a small amount is always 
advantageous. In particular, an infinitesimal reward is sufficient to reduce fluctuations 
by a finite amount, for a < ac- 

Let us finally come to the case rj — 1, corresponding to NE, in which, as we said, 
steady states coincide with the states of minimum a^. One understands that these 
minima occur when agents play only one of their available strategiesT', since o"^ attains 
minima in the corners of the configuration space [—1, 1]^. The statistical properties of 
the minima of o"^ can again be analyzed with tools of statistical mechanics. As is clear 
from Fig. 18, for rj — 1 one is always in the phase with broken replica symmetry because 
0"^ attains its minima on a disconnected set of points. For S = 2 strategies per agent it 
has been shown analytically via the so-called annealed approximation that the number 
of NE (i.e. of minima of cr^) is exponentially large in N (see Fig. 20). It is clear that the 
global efficiency of NE is better than in the standard MG, since fiuctuations are smaller. 
Furthermore, increasing the number S of strategies the efficiency of NE increases (i.e. 
(j^ decreases) as shown in [52]. This contrasts with what happens in the MG, where the 
efficiency generally decreases when S increases. Therefore, not only agents in the MG 
play sub-optimally, but the more resources they have the larger is the deviation of their 
behavior from an optimum. 

Wc arc still left with the question: why do agents in the MG play sub-optimally? 
In order to answer, let us consider the case of an external agent with S strategics, 

V There may also be other NE, which correspond to saddle points of and are hence stationary 
points of the multi-population replicator dynamics. Agents do not play evolutionarily stable strategies 
in these NE and as we shall see the dynamics of learning never converges to these states. Hence we do 
not consider these NE further. 
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an agent who does not take part in the game but just observes its outcome from 
the outside. From this position, each of his strategies dehvers an average virtual gain 
TT™ = —ttg {A) {g = 1, . . . , S). Given that the strategies a'^ are drawn randomly, the 
7^™^s are independent random variables. Moreover, since it™ is the sum of P ^ 1 
independent variables (A'^) /P, their distribution is Gaussian with zero mean and 
variance 

Var «) = -1 f: Var(a,^) {A\^,f = |. (142) 

Clearly, the strategy g* bearing the highest expected profit Trjf is superior to all others. 
It would be most reasonable for this agent to just stick to this strategy. 

However, the same agent inside the game will typically use not only strategy g* 
since every strategy, when used, delivers a real gain which is reduced with respect to 
the virtual one by the "market impact" . Imagine the "experiment" of injecting the new 
agent in a MG. Then, neglecting the reaction of other agents to the new-comer, one 
would have that {A\ijl) {A\ijl) + dg- Then the real gain of the newcomer is: 

TT^' ~ -^;{A) - {ag ag) = vrf - 1. (143) 

The agent will then update the score of the strategy he uses (say g) with the real gain 
TTg^^^ and those of the strategies he does not use (say g') with the virtual one, so that 
Ug> — TTgJ^^ + 1 — agUgi ~ TT^f^' + L Thcrcfore agents in the MG over-estimate the 
performance of the strategies they do not play. Then if strategy g is played with a 
frequency fg, the virtual score increases on average by 

Vg = Ug{t + 1) - Ugit) = TT^^ -fg+1 (l44) 

The fact that a good strategy g is used frequently reduces its perceived success^ and 
leads agents to mix their best strategy with less performing ones. This is a consequence 
of the fact that agents neglect their impact on the market. It is now clear why, given 
that the market impact reduces the perceived performance Vg of strategies by an amount 
which equals the frequency fg with which strategies are played, agents can improve their 
performance if they reward the strategy which they have played by some extra points 
(the T) factor). This contributes a term rjfg to the rate of growth of strategy g so (144) 
becomes Vg = vr^'^^^ — {1 — ri)fg + 1. Any rj > reduces the market impact and improves 
agent's performance. In particular for rj = 1 agents properly account for the market 
impact and indeed in this case the growth rate Vs of their strategies do not depend on 
the way they play. 

More precisely the frequency fg with which the agent plays strategy g will be such that the rate 
of increase of the scores is the same Vg = v* for all strategies with fg > 0. Strategies which are not 
played {fg = 0) have tt^®^' + 1 < v*. Considering the reaction of other agents does not modify these 
conclusions. 



CONTENTS 



58 



4.-8. Exogenous vs endogenous information 

In the El Parol problem and in the MG the state ^{t) is determined by the outcome of 
past games, as in (32). In other words ii{t) is an endogenous information which encodes 
information on the game itself: agents record which has been the winning action in the 
last m = log2 P games and store this information in the binary representation of the 
integer fi. How do the results which we derived for exogenous information, i.e. when /i 
is just randomly drawn at each time, change if we go back to endogenous information? 

This issue has been the subject of much debate and considerable analytical and 
numerical work was required to settle it. We will limit ourselves here to a sketch of 
the line of reasoning and of the results. As we said, it was at first believed, based on 
computer simulation, that the MGs with exogenous and endogenous information yield 
the same macroscopic pictures. However the situation turned out to be more subtle. 
In fact, (32) implies that the dynamics of ^{t) depends on the collective behavior of 
the game outcome A(t). The key quantity to understand the dynamics of information 
patterns is the stationary state distribution of the process fi{t) which is induced by the 
dynamics of A{t). As in the El Parol model, this process is a diffusion on a De Bruijn 
graph, where the transition probabilities depend on the statistics of A{t) conditional on 
a particular site ii of the graph. When the dynamics of A{t) has a strong stochastic 
component, which occurs when many agents play in a probabilistic fashion (i.e. when 
\mi\ — I (sj) I < 1), all possible transition /i ^ /i' occur with a positive, finite probability. 
Hence the stationary state distribution has a support on all the states // e {1, . . . , P}. 
At odd with the case of exogenous information, some state may be visited more often 
than some other state, but all states are visited. This leads ultimately to the same 
qualitative scenario as in the completely random case and explains the early numerical 
finding on the irrelevance of the origin of the information in the MG*. Roughly speaking, 
one can say that this scenario holds whenever 



i=l 

which in sufficient to ensure that agents behave in a probabilistic way. 

To be more precise, one can analyze the steady-state distribution of history 
frequencies p(/i) relative to the uniform case, which is given by 



(if p{fi) = 1/P for all II, Q{f) is a delta- distribution at / = 1) as was done e.g. in [55]. 
This quantity is reported in Pig. 21. One sees that in the supercritical regime the 
distribution is indeed not uniform. This explains why, from a quantitative viewpoint it 
turns out that macroscopic obscrvables actually depend on the type of information 
in the asymmetric regime a > «c where the deviations of the history frequency 

* Rather than the origin of information, Ref. [48] speaks of irrelevance of memory. The term "memory" 
is used in an improper way. Actually the memory of agents is stored into their scores Uig{t). 




(145) 




(146) 
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Figure 21. Relative distribution of frequencies Q{f) for at a = 0.1 (top) and a = 2 
(bottom). Simulations performed with aN"^ = 30000, with averages over 100 disorder 
samples per point. 



distribution from uniformity are more significant. The arguments just described, though 
approximate, are able to account for these deviations rather well. Recently, the dynamics 
of the MG with endogenous information was solved exactly by the generating functional 
method [56], confirming the general picture outlined above. 

Clearly, the situation changes drastically when one considers the MG corrected for 
the market impact with -q — 1. We know that all agents ultimately freeze in this case, 
so that the learning dynamics converges to a state with Q — 1, i.e. with no stochastic 
fluctuations; therefore in the long run A{f?) becomes a function of ^(t) alone. This 
means that the dynamics of /i(t) becomes deterministic: it locks into periodic orbits 
of the order of \/P values of ji. As a consequence, only a tiny fraction of information 
patterns are generated by the dynamics of A{t) and these few on the periodic orbit are 
visited uniformly (one after the other) . This dynamic reduction of the size information 
space from P to a number of order \fP imphes a similar reduction of the effective value 
of a to something close to 0. Given that a"^ /N decreases with a, we conclude that the 
performance of the system with endogenous information improves with respect to the 
case of exogenous information. For intermediate values of r] and endogenous information 
the system interpolates between the two extreme behaviors of the standard MG {rj = 0) 
- where the origin of information is to some extent irrelevant - and of the sophisticated 
agents {rj = 1) case - where a dynamic selection of a small subset of states occurs. 

5. Extensions and generalizations 

We shall discuss now a few variations on the MG theme, mostly inspired by problems 
related to financial markets, in particular by the origin of the peculiar intermittent and 
non-Gaussian ('fat tailed') fluctuation patterns they generate. In the reference model 
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of price dynamics, which is the simplest one accounting for no-arbitrage hypothesis 
and market's efficiency, the logarithm of prices performs a random walk and hence 
returns are gaussian. On the other hand, several complex agent-based models are able 
to reproduce a realistic phenomenology to a high degree but with little analytic control. 
In the context of MGs we shall see that heavy tails in the distribution of returns and 
clustering in time emerge close to the phase transition, which suggests that markets 
operate close to criticality. Realistic behaviour persists also when agents have a finite 
score memory, but it disappears as soon as agents account for their market impact. We 
shall also briefly discuss MGs with many assets, in which agents have to choose among 
several assets with different information content. Then we shall move on to Majority 
Games and review the properties of mixed models in which fundamentalists and trend- 
followers interact. A discussion of a model with asymmetric (private) information closes 
the section. 

5.1. Grand- canonical Minority Game and stylized facts 

The following model introduces volume fluctuations in the MG, as the number of agents 
involved in the game varies from one time step to the next. In the grand-canonical 
MG [57], each agent i has at his disposal only one quenched random trading strategy 
tti = {of} and has to choose whether to join the market = 1) or not = 0) at 

every time step. In order to make this decision the agent compares the expected profit 
from joining the market to a fixed standard. The model is completely defined by the 
following scheme: 

A{t) = Y,maf^ (147) 

i 

Ui{t + 1) - Ui{t) = -af(*U(i) - €i 

The quantity represents the benchmark: ej < means that agents have an incentive 
to take part in the market because, for instance, they are urged to sell or exchange 
assets; > implies that agents receive a fixed positive payoff by staying away from 
the market, like a fixed interest from a bank. Alternatively, can be seen as the a 
priori incentive of agent i to enter the market: if < (resp. Cj > 0) the agent has 
a small incentive to enter (resp. stay out). One can consider two different types of 
agents: producers, who always enter the market and are characterized by = — oo; and 
speculators, who instead aim at taking profit of fluctuations and are characterized by a 
finite e^. We set 

ei = e hi l<i< Ns 

€i = -oo ior Ns + l <i < Ns + Np = N 

where A^^ and Np stand for the number of speculators and producers, respectively. 
Speculators act on the market only if they expect to receive a payoff higher than the 
benchmark; producers act no matter what. 
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Figure 22. Left panel: relative number of active agents (top), volatility and 
predictability per pattern (bottom) as a function of ris for e = 0.1 (open markers) 
and e = —0.01 (full markers). Right panel: cumuluative probability distribution 
P>{A) = Prob{|A(t)| > x} versus x in the steady state. Inset: time series A{t) 
versus t for = 20 (top) and = 200 (bottom). Prom [57]. 



The relevant control parameters are the relative number of speculators and 

producers, respectively: = Ng/P and Up = Np/P. As usual, one is interested in 
the behavior of the volatility and of the predictability H. Besides, it is interesting 
to analyze also the relative number of active speculators, defined as 

riact = ^ E ('^^) (148) 

i 

Results are shown in Fig. 22. On sees that with a fixed number Up of producers, the 
market becomes more and more unpredictable, i.e. H decreases, as the number Ug 
of speculators increases, independently of the value of e. At the same time also the 
volatility cx^ decreases as agents play in an increasingly coordinated way. In a market 
with few speculators (n^ < 1 in Fig.), most of the fluctuations in A{t) are due to the 
random choice of ii{t) (i.e. cr^ ~ H) and the number ^act of active speculators grows 
approximately linearly with Ug. When Ug increases further, the market reaches a point 
where it is barely predictable. Now the collective behavior becomes e-dependent: 

• for e < the relative number of active speculators continues growing with Ug even 
if the market is unpredictable if ~ 0. The volatility cr^ has a minimum and then 
it increases with Ug 

• for e > 0, instead, the relative number of active traders decreases and finally 
converges to a constant. This means that the market becomes highly selective: 
only a negligible fraction of speculators trade {(f)i{t) = 1) whereas the majority is 
inactive {4>i{t) — 0). The volatihty also remains roughly constant in this limit 

In other words, e = forn^ > n*{np) (n*(l) = 4.15 . . .) is the locus of a first order phase 
transition across which A^act and cr^ exhibit a discontinuity. 
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Figure 23. Kurtosis of A{t) in simulations with e = 0.01, Us = 70, rip = 1 and several 
different system sizes P for F = 1, 10 and oo. 

So far for collective properties; what about stylized facts? Numerical simulations 
reproduce anomalous fluctuations similar to those of real financial markets close to the 
phase transition line. As shown in Fig. 22, the distribution of A{t) is roughly Gaussian 
for small enough (it must tend to a Gaussian when Us — > 0), and has fatter and fatter 
tails as Ug increases. The same behavior is seen for decreasing e: fat tails emerge in the 
vicinity of the critical point. In particular the distribution of A{t) shows a power law 
behavior -P(|A| > x) ~ with an exponent which can be estimated to be /5 ~ 2.8, 1.4 
for Ug = 20, 200 respectively and e = 0.01. With = 100 the exponent takes values 
f3 ~ 1.4,2.3,3.1 for e = 0.01,0.1,0.5. Note that empirical values of (3 typically range 
from 2 to 4. Finally: volatility clustering is observed in conjunction with the power-law 
tails (see inset). 

Let us analyze more closely the emergence of power-law tails in the distribution 
of A{t) and of volatility clustering. In Fig. 23 the kurtosis excess (if x is a generic 

random variable with zero mean, K is defined as = — 3; loosely speaking, it 
is a convenient proxy for the distance of a certain distribution from a Gaussian, for 
which i^' = 0) of the distribution is shown as a function of the system size and of the 
learning rate F for a 'regularized' model with choice rule 

Prob{0i(t) = 1} = 1/[1 + e~^^'(*)] (149) 

One sees that as the system size increases (or if one introduces a small enough learning 
rate F,see below) the distribution tends to a Gaussian as K decreases with P. Moreover 
we see that for a rage of parameters the appearance of fat tails is sample- dependent, as 
both samples with and without fat tails may occur. 

This behaviour is reminiscent of well-known finite-size effects in the theory of critical 
phenomena: in the d-dimensional Ising model, for example, at temperature T = Tc + 'j 
critical fluctuations (e.g. in the magnetization) occur as long as the system size N 
is smaller than the correlation volume ~ 'y~'^'^. But for ^ 'y~<^'^ the system shows 
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the normal fluctuations of a paramagnet. Some light on the finite-size effects in our 
case can be shed by studying the continuous-time limit of the score updating dynamics. 
Regularizing the choice rule to 

prob{0i (t) = 1} = 1/[1 + e-r^* (150) 

with learning rate F, and applying the machinery described in Sec. 4.5, one can 
transform the discrete-time learning dynamics into the continuous-time Langevin 
process 



Ui{t)^-ai{A)-e + r]i{t) (151) 



'y 



{m{t)r]j{^')) = j^a^At-t') (152) 
Notice that the noise strength is proportional to the time dependent volatihty cr^ = (^4^) . 



The noise term is a source of correlated fluctuations because ajCj {A^)/N ~ 1/viV is 
small but non zero, for i ^ j ii N is finite. This noise competes with the deterministic 
part of (151): if the former outweighs the latter, then one expects that the dynamics 
will sustain collective correlated fiuctuations in the Ui{t) which otherwise would be 
washed away. In order to obtain an approximate analytic condition for the onset of 
volatility clustering one may then compare the noise correlation term, which is of order 
ttittj {A^)y/N ~ (T^/P^/^ for i ^ j, with the square of the deterministic term of (151), 



2 




which is given by {A)y + e ~ \/H/ P + e . Rearranging terms, one finds that 
volatility clustering can be expected to set in when 

^ (153) 

P 

where 5 is a constant. This prediction finds remarkable confirmations in numerical 
experiments [57]. Recalling the analogy with magnetic systems made at the beginning 
of this section, one understands that (153) and H/P ~ imply that the same occurs 
in the GCMG with dv — 4. In other words, the critical window shrinks as N~^^^ when 
N ^ oo. However, because of the long range nature of the interaction, anomalous 
fluctuations cither concern the whole system or do not affect it at all. In the critical 
region the Gaussian phase coexists probabilistically with a phase characterized by 
anomalous fluctuations. This, like the discontinuous nature of the transition at e = 0, 
is typical of flrst order phase transitions. 



5.2. Market ecology 

One of the flrst modiflcation of the MG has investigated the effects of introducing an 
explicit asymmetry in the two possible actions [58]. This is the case of the El Farol bar 
problem: the actions 'go' or 'don't go' to the bar are not symmetric because (i) if one 
takes the wrong action there is still a difference between going to a crowded bar and not 
going to an uncrowded bar and (ii) the comfort level corresponds to a share of 60% of 
agents attending. If each agent takes the opposite choice one ends up in an inefficient 
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attendance of 40%. The outcomes of the MG are instead symmetric: If every agent 
switches to the opposite choice, all the payoffs remain unchanged. Quite generally this 
leads to study games where the payoffs to agent i at time t is given by 



where Aq — {A^} is some fixed vector. In particular, [58] investigated the case where 
Aq — L independently of as in the El Parol bar, and where information is endogenous. 
Interestingly, because of the fact that due to (32) some values of ji occur more often 
than others, the conclusion that the collective behavior is independent of whether the 
information ^ is endogenously generated or is exogenous (i.e. random), which was 
roughly correct for the standard MG, is not true in this case. 

There is however a second motivation for considering a model based on (154) which 
was explored in [59,60]. Gonsidering the MG as a model of a financial market, it can be 
argued that there are different types of market participants with different goals. Some 
trade to gain money from transactions with no particular interest in the asset they buy 
and sell. Only price fluctuations matter for this kind of traders, which one usually calls 
'speculators'. Another type of market participants are those who use the market for 
exchanging goods. This is indeed the reason why markets exist. This type of agents 
is interested in the asset itself: they will buy it or sell it irrespective of the history of 
recent fluctuations: this type of agents can be called producers. While speculators have 
a range of behavioral rules which process the available information in search of arbitrage 
opportunities, producers use a trading rule which is constant in time. Producers are 
part of the financial world and their behavior is correlated with the state of the world ^ 
which is thought to capture all relevant economic information: in other words, they only 
have one strategy at their disposal. This type of traders play a role similar to that of 
hedgers^: they inject information into the market. Their trading activity is completely 
predictable given the state of the world /i and the term Aq represents their aggregate 
contribution to the market. 

It is easy to understand that in a market composed of producers only the 
distribution of price changes would be nearly Gaussian: in fact, Aq can be regarded 
as the sum of Np random terms, where Np is the number of producers. The process 
associated to producers can be considered as the fundamentals, i.e. the price process 
which refiects the economic performance of the asset. Roughly speaking, one may expect 
that speculative trading will color this process and transform its statistical properties. 
Actually the discussion may be extended to a ftirther type of agents, the so-called noise 
traders. These persons totally disregard the state of the world fi or have no information 
at all on it. They rather follow rules of behavior which arc statistically uncorrclatcd 
with fi (such as the moon phases) and with the behavior of other agents. The presence 
of these agents does not introduce any new qualitative features. The question is: how 
do all these "species" of traders interact? 

ti A hedge is an action (e.g. buy/sell) done with the aim of reducing the risk of another action. 




(154) 
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Figure 24. Average gains of producers and speculators as a funcion of tlie (reduced) 
number N/P of adaptive agents (speculators). The plot refers to a system with Np — P 
passive agents (producers). The gain of speculators is positive only when they are few 
and it decreases when new speculates join the market. Producers losses are reduced by 
speculators. The predicatbility H is also plotted. Inset: Phase diagram in the space 
of the reduced numbers of speculators and producers. The shaded region to the right 
of the solid line is the symmetric phase where H = 0. The gain of speculators vanishes 
on dashed line and it is positive in the region to the left. 



An intuitive argument runs more or less as follows. First, note that in a market 
composed of producers price changes would depend only on fi. Such a highly predictable 
market is very favorable for speculators who may derive considerable gains. However 
when more and more speculators join the market, its predictability decreases and the 
profit of speculators gets more and more meager. This effect is illustrated in Fig. 24, 
which also shows that producers instead benefit from the presence of speculators because 
their losses are reduced. When the number of speculators increases beyond a critical 
value, which depends on the relative number Np/ P of producers, the market enters the 
symmetric phase where H = and the outcome A{t) becomes unpredictable from /i. 
This shows that the relation between these two species is more similar to symbiosis than 
to competition: producers feed speculators by injecting information in the market and 
benefit, in their turn, of the liquidity provided by speculators. 

5.3. Multi-asset Minority Games 

5.3.1. Definitions and results Minority Games with many assets have been introduced 
in order to investigate how speculative trading affects the different assets in a market 
[61,62]. A tractable version of these models has been considered in [63], with the aim 
of studying how agents modify the composition of their portfolios depending on the 
'complexities' or information contents of the different assets. 

The model consists essentially of two coupled MGs with one strategy each. Let us 
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consider the case of a market with two assets 7 e { — 1, 1} and agents. At each time 
step I, agents receive two information patterns ix^ G {1, .... P^,}, chosen at random and 
independently with uniform probabihty. As always, Py is taken to scale linearly with iV, 
and their ratio is denoted by = P^/N. Every agent i disposes of one trading strategy 
per asset, a^^ = {0^!^}, that prescribe an action af^ e 1} (buy/sell) for each 

possible information pattern of asset 7. Each component af^ is selected randomly and 
independently with uniform probability and is kept fixed throughout the game. Traders 
keep tracks of their performance in the different markets through a score function Ui^{£). 
The behavior of agents is summarized by the following rules: 



Si(t) = sign[7/j(t)] 

N 

Mt)-T.^^fSs,^t),, (155) 

Ui^{t + 1) - Ui^{t) = -a';f^A^{t)/^/N 

where A^{t) represents the 'excess demand' or the total bid of asset 7, while ?/,;(t) = 
^^7f/j^(t). The Ising variable Sj indicates the asset in which player i invests at time t, 
which is simply the one with the largest cumulated score. As usual, it is the minus sign 
on the right-hand side of (155) that enforces the minority-wins rule in both markets. 
It is possible to characterize the asymptotic behaviour of the multi-agent system (155) 
with a few macroscopic observables. In the present case, besides traditional observables 
such as the predictability H and the volatihty u^, defined respectively as 

1 

7e{-i,i} ^ M7=i 

7 

it is important to analyze the relative propensity of traders to invest in a given market, 
namely 

1 ^ 

^ = TV E (^^) (158) 

i=l 

A positive (resp. negative) m indicates that agents invest preferentially in asset +1 
(resp. —1). 

The phase structure of the model is displayed in Fig. 25. The (q;+,q;_) plane is 
divided in two regions separated by a critical line. In the ergodic regime, the system 
produces exploitable information, i.e. H > 0, and the dynamics is ergodic, that is 
the steady state turns out to be independent of the initialization Ui^{0) of (155). 
Below the critical line, instead, different initial conditions lead to steady states with 
different macroscopic properties (e.g. different volatility), but traders manage to wash 
out the information and the system is unpredictable {H = 0). This scenario essentially 
reproduces the standard MG phase transition picture. 
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Figure 25. Left panel: analytical phase diagram of the canonical two-asset Minority 
Game in the (Q;+,a_) plane. Right panel: behavior of m (top). H (middle) and cr^ 
(bottom) versus a+ — a- for a+ + a_ = 0.5. Markers correspond to simulations with 
N = 256 agents, averaged over 200 disorder samples per point. Lines are analytical 
results (from [63]). 

The behaviour of the macroscopic observables m, H and cr^ along the cut q;++q;_ = 
1/2 (in the ergodic phase) is also reported in Fig. 25. One sees that agents play 
preferentially in the market with smaller information complexity, which is particularly 
inconvenient as it coincides with the one with less exploitable information. This is 
a somewhat paradoxical result since a nai'vc argument would suggest that agents are 
attracted by information rich markets. It actually turns out that this simple argument 
is incorrect and the observed behavior is due to the fact that agents are constrained to 
trade in one of the two markets. Rather than seeking the most profitable asset, agents 
simply escape the asset where their loss is largest. The conclusion is indeed reversed 
when traders may stay out of the market and have negative incentives to trade (that 
is, when they have an incentive not to trade). In this case, which corresponds to a 
grand-canonical multi-asset MG, the information-rich asset is chosen preferentially [63], 
though the phase structure becomes more complex than usual as new phases (with 
broken ergodicity and global predictability) arise. Note however that in this framework 
no correlations among the assets emerge, i.e. {A^A^^) = 0. Indeed 

Now, the dynamical variables Ui^{t) evolve on timescales much longer (of order P^) 
than those over which the /i^ evolve. Hence we can safely assume that the distribution 
of Si is independent of /^^ and factorize the average (a^^a^l) = (a^^)(ajl) over the 
independent information arrival processes iJ>±{t). Given that {af^) — the conclusion 
{Aj^A_) ~ follows immediately. The reason for this is that traders' behavior is aimed 
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at detecting excess returns in the market with no consideration about the correlation 
among assets. This conclusion is against the empirical evidence, as in real financial 
markets correlation between stocks are overwhelmingly positive (if it wasn't so, making 
money in a financial market would be much easier!). The microscopic origin of this 
phenomenon is a rather difficult issue, which will surely receive much attention in the 
near future. 

Below we describe the dynamical solution of this model, as an example of the 
application of the path-integral formalism to this type of problems. 

5.3.2. Dynamics (path-integral approach) The dynamical approach to the stationary 
macroscopic properties of Minority Games is based on the use of dynamical generating 
functionals a la Martin-Siggia-Rose [64] to turn the original multi-agent process into 
a single stochastic equation for the behavior of a single 'effective agent', similarly to 
what is done to study the dynamics of spin systems with quenched disorder after [65]. 
This procedure, which was first applied to Minority Games in [66], allows ultimately 
to derive closed equations for correlation functions, response functions, and all other 
relevant time-dependent macroscopic parameters. Typically, the resulting equations are 
too complicated to be solved at all times. However, with suitable Ansatze one may 
restrict the analysis to specific solvable regimes (in this case, we shall focus on ergodic 
steady states). Dynamical phase transitions can then be identified from the breakdown 
of the assumed behavior. The method is very general, it doesn't rely on the existence of 
a Hamiltonian nor on the validity of detailed balance, but requires an analytical tour de 
force for solving the most general MGs. Luckily, some reasonable starting simplification 
help to make it less cumbersome. One is Markovianness, which in MGs corresponds to 
models with random external information. Another is changing the updating rule from 
the usual 'on-line' learning, in which agents modify their preferences at each time step, 
to a 'batch' learning, in which agents update their preferences only after they have seen 
all possible information patterns Strictly speaking, the batch process is not equivalent 
to the on-line process but in many cases, including that which we consider here, the 
two are qualitatively identical. Both simplifications will be made in this section, where 
we expound the dynamical solution of the canonical multi-asset MG. The method is 
described in detail for other models and more general cases in [43]. 

So we consider two coupled GCMGs, interpreted as a system with two assets 
characterized by different sizes of information sets and, on the agents' side, by different 
strategies and valuation functions. Prom (155), one sees that the preferences evolve 
according to 

y,(t + l)-y,{t)^- l<fA,{t)/^ (160) 

7e{-i,i} 

The 'batch' approximation is obtained by averaging the right-hand side over the //^'s. 
This leads, after a time re-scaling (for simplicity, we denote the re-scaled time again by 
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N 

7e{-i,i} i=i 

where = l/a^ and J^^ = {)-/N)Y^^_^a^^a^^ are quenched random couphngs of 
Hebbian type. We also introduced the variable 

Mt)-lK{t),^-\b + Si{t)] (162) 

All moments like ■mi{t) — {si{t)) and Cij{t,t') — {si{t)sj{t')) - the brackets standing for 
an average over all possible time evolutions of the system - and in turn macroscopic 
quantities like the magnetization m = autocorrelation function 

C{t,t') — {{'^iCii{t,t')/N)) can be derived formally from the generating functional 

Z[-0] = (e^^*^W-«(*)> (163) 

by taking suitable derivatives with respect to the auxiliary generating fields i/? = {V'i}; 
for instance 

The (• • •) average is performed by imposing that the Sj satisfy (161) at each time step: 



C%t') = -±y:Ura^j§i% (164, 



{{Zm) = j p[y(0)] e^2*^W-W (l^W[y{t) ^ yit+l)]j^ dy{t) (165) 



W[y{t)^y{t + l)]=Y[5 



with transition matrix fixed by (161): 

N 

y^{t + l)-y,{t)-h,{t)+ ^iT.'^ij'i'nit) (166) 

7e{-i,i} j=i 

The fields hi{t) will be used to generate response functions. At this point the following 
steps need to be taken: 

a. Introduce the order parameters 

1 ^ 

1=1 
1 ^ 

i=l 
1 ^ 

KM = -J2s^{tMt') (167) 



N 
1=1 



1=1 
1 ^ 
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in (165) via such identities as 



1 



= j dQ{t,t')S 



N 



NQ{t,t')-J2si{t)si{t') 



i=l 
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(168) 



b. Use the integral representation for the (5-distributions; 

c. Average over the quenched disorder after isolating the relevant terms with the help 
of the variables 



17 



W. 



(169) 
(170) 



These steps require standard manipulations at most. After a factorization over i and 
/x-y, one arrives at 



(171) 



where Q{t,t') = {Q{t,t'), L{t,t'), K{t,t'), a{t), k{t)} is the vector of order parameters, 
Q{t,t') = {Q{t,t'), L{t,t'), K{t,t'),a{t),k{t)} is the conjugate vector of Lagrange 
multipliers, while the functions ^, $ and fl are given by 



* = i^ ^a(t)a(t) + e(t)£{t)- 
t 

+i J2 [<9(^' ^')Q(^, t') + Ht, t')Ht, t') + K{t, t')K{t, t') 

t,t' 

i ^ t 

7 

+q;^ log j dwe~^ L{t,t')w^it)w^(t')-^ Et.t'KK-D^)"' A](«,*')'27(«)'27(*') | 



(172) 



where 



D^{t, t') = ^ [1 + ia{t) + ja{t') + Q{t, t') 

177 

A,{t,t')^5u'-^bm+K{t,t')\ 



(173) 

(174) 

(175) 
(176) 



In the limit N ^ oo the integral (171) is dominated by the saddle-point where the 
order parameters take the values 

C{t,t') = {s{t)s{t'))^ L{tX) = {y{mt')). 

K{t,t') = {smt')i <t) = {s{t)i 



C{t,t')^i 



dC{t,t') 



(177) 
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aL(i,i') ' ' ' dK{t,t') 

a(f) = 1—— k{t) 



where 



da{t) ' ' dk{t) 

, . _ 1 ^ I---M{{ym,{ym)Utdy{t)dy{t) 

^ h j M{{y{t)},{ymi\tdyit)dy{t) ^ ^ 

denotes an average performed with the measure 

xe-'St.*'P(*'*')^W^(^')+^(*'*')^'W^'(*')+^(*'*')*W^'(*')] (179) 

Now comparing the above averages with the derivatives of ((Z)) with respect to 
il) and h one easily sees that, in the hmit A'" — > oo, Q{t,t') may be identified with the 
autocorrelation function C(t, t'), a{t) turns out to coincide with the magnetization m(i), 
whereas K{t, t') may be related to the response function 

G(M')= l..nly:«)» (180) 

^ ' ' N^oo N ^ dhi{t') ^ ' 

through K{t^ f) — iG{t, t'). Working out the remaining equations, and in particular the 
expression of one finds in addition that 



L = k = C = a^0 

7 7 
£=-^E[^7~'K^7)A-'] 



7 



Therefore M can be seen as describing the single-agent process with noise z{t) given by 
y{t + 1) - yit) = - J] [l + it, t')<P,{t') + zit) (182) 



7,t' 



{z{t)z{t')) = E (i + Y^) ' K^^) (i + ^g) ■ 



7 



{t,t') (183) 



which is completely equivalent to the original multi-agent system in the limit N ^ oo. 

Let us now focus on the asymptotic properties of the stationary state, considering 
the simplest possibility. Making for the asymptotic behavior of C and G the 
assumptions of time-translation invariance, 

lim C{t + r,t)^C{r) (184) 

t— >oo 

lim ^(t + r,^) = G(r) (185) 

t— >oo 

finite susceptibility, 

lim VG(i,i') < oo (186) 

t— >oo ' 

t'<t 
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and weak long-term memory, 

limG(i,i') = Vt' finite (187) 

ergodic stationary states of tfie dynamics can be fuiiy cfiaracterized in terms of a few 
parameters. These are, in particular, the persistent autocorrelation 

c= lim - VC(t) (188) 

t<T 

the magnetization 

m= lim - Vm(i') (189) 

t^oo t 

t' 

and the susceptibility (or integrated response) 

X=lim^G'(i) (190) 



T-^OO ■ 

t<T 



In this regime, the quantities 

y=\im^ s^\im-Ys{t') z = lim -Yz{t') (191) 

i— >oo t t— >oo i ' >oo t ' 

t' t' 

are easily seen to be related by 

s + 7 



where 



= (193) 

We have the following scenarios: 

(i) if y > 0, then s = 1 (the agent is frozen on asset 1): this occurs ii z > k,+ 

(ii) if y < 0, then s = — 1 (the agent is frozen on asset —1): this occurs if z < — 

(iii) if y = 0, then s — s* = '^^ ^''J'^'' (the agent is fickle): this occurs if —K- < z < k+ 

Separating the contribuctions of different cases we end up with the following equations 
for m, c and x' 

m = {9iz - K^))^ + {s^B{z + K.)e{K+ - z))^ - {e{-K. - z))^ 

C = {0iz - K+))^ + {{S^fe{z + K.)e{K+ - Z))^ + {e{-K. - Z))^ (195) 

where (• • ■) ^ is an average over the static Gaussian noise z. The Gaussian integrals 
can be easily computed and these equations can be solved numerically for c, m and x- 
Notice that n+ > n_ (or q;+ < a J) implies Kj^ < so that the probability that an 
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agents 'freezes' on asset 7 is larger for 7 = +1, i.e. for the asset with less information. 
This conclusion is immediately clear from the above equations. A little more work is 
required to see that H is given (apart from factors a-y) by the persistent part of the 
noise variance (183): 

.?(l+2„n + c) 

These expressions finally yield the analytical curves shown in Fig. 25. 
5.4.- The Majority Game 

The simplest way to get a glimpse on the macroscopic properties of the Majority Game 
is to consider the simplified information-free context of Sec. 4.3, where the model is 
described by the rules 

Prob{6i (i) = 6} = C exp [fcA^ (i)] (197) 
A,(i + l)-A,(i) = rA(0/iV (198) 

by which agents reward the action taken by the majority and increase the probability 
of choosing hi{t + l) — sign [^(i)]. An analysis similar to that outhned in the case of the 
Minority Game easily leads to the conclusion that the dynamics of y{t) — Aj(i) — Aj(0) 
(which is i-independent) admits the solution y{t) — yo + vt where v — ±r. In this state, 
agents behave coherently (bi(t) = b for all i). Consequently, (A) is either N or —N and 
(7^ — 0{N^) independently of F. 

The above conclusion that Majority Games generate huge fluctuations is rather 
intuitive. However the full Majority Game turns out to be a surprisingly rich model [67]. 
It is defined by the following setup: 

gi{t) = arg max[/jg(i) 

i 

U.,{t + 1) - U,,{t) = aff [A{t) - 7J - <^)^ 

where /i(t) G {1, .... P} stands for the information pattern presented to agents at time 
t (taken to be external and random) and rj tunes the agents' ability to learn to respond 
to the action of all other agents by disentangling their own contribution to the game's 
outcome. 

Using the notation introduced in Sec. 4.5, it is easy to see that 

Vi = {yi{t + l)-yi{t))=^+Y^ Wj^j - V^m (200) 

i 

where rrii = (sign(|/j)). Hence the dynamics minimizes the function 

^j^i^j - E + 1 E (201) 

i,j i i 
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Adding the constant —W-jl to complete a square with the first to terms above, one 
sees that is a downward concave function of the mj's, which imphes that minima 
occur on the corners of the definition domain [—1,1]^. Thus the solution with f j = 
corresponding to fickle agents is ruled out in this case and the only remaining solutions 
are those with 7^ (and yi{t)/t finite as t — > 00), corresponding to frozen agents. For 
these, 

rrii = s\gn{vi) = sign ^iQ. + ^ iiijrrij - -nQm^j (202) 

Notice that since the relevant steady states have = ±1 the last term in plays the 
role of a mere constant. Hence impact factors do not alter the steady state properties of 
the Majority Game. (Also due to agents' freezing, the 'batch' and 'on-line' version yield 
the same stationary properties as fluctuations play no role in this case.) Furthermore, 
it is clear that any configuration {rrii} which is a solution of these equations for some 
value of 77 e [0, 1] will also be a solution for all rj' < rj. Hence the set of stationary 
states is such that C Sj,' for rj' < r) and, in particular, <Si C ^S^ for all 77 < 1. It is 
also easy to see that the state with minimal value of if^ lies in Si for all 77 e [0, 1]. This 
shows that Nash equilibria arc stationary states of the majority game for all values of 
?7, but the converse is not true (except for 77 = 1 of course). 

It is possible to draw a complete picture of the model's behavior by studying the 
minima of H^^ explicitly via the replica method. The calculation has been carried out 
in [67] under the assumption that the two strategies of the same agent can be to some 
degree correlated, which is allowed if one takes the disorder distribution 
w 1 — w 

P{ai, 02) = — (5ai,l5a2,l + ^ai -1^02 -1) H ^ {^aul^ai-l + (^ai -1^02,1) (203) 

Notice that w — Prob{af]^ = 0^2}- It turns out that, depending on the parameters, the 
system can be in one of two phases: a 'retrieval' phase characterised by attractors with a 
macroscopic overlap = 0{N) with a given pattern (say, = 1) and a spin glass phase 
with no retrieval (A^ = 0{\^)). The occurrence of 'retrieval' may be thought of as the 
emergence of crowd effects such as fashions and trends, when a large fraction of agents 
behave similarly in some respect, or to economic concentration, when, for example, one 
particular place is arbitrarily selected for large scale investments. Interestingly, one 
finds that the development of these crowd effects requires: (i) that the number of agents 
is large compared to the number of resources {a small), (ii) a sufficient differentiation 
between strategies of agents {w < 2/3) and (iii) a large enough initial bias (i.e. an 
initial macroscopic overlap) towards a particular resource, fashion or place. Finally 
crowd effects can be sustained under more general conditions (i.e. in the spin glass 
phase) if agents do not behave strategically, i.e. if they neglect their impact on the 
aggregate (77 small). This phenomenon can be attributed to the self-reinforcing term 
{l — ri)^fsi in the dynamics which causes a dramatic increase in the number of stationary 
states as 77 decreases (which can be seen quantitatively by analyzing the entropy). 
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5.5. Models with interacting trend-followers and contrarians 

It is rather easy to understand that the two main groups of traders, that is 
fundamentahsts and trend-followers, contribute opposite forces to the price dynamics. 
Fundamentalists believe that the market is close to a stationary state and buy (sell) when 
they repute the stock to be underpriced (overpriced), thus inducing anti-correlation in 
market returns and holding the price close to its 'fundamental' value. Trend-followers, 
instead, extrapolate trends from recent price increments and buy or sell assuming that 
the next increment will occur in the direction of the trend, thus creating positive return 
correlations and large price drifts ('bubbles'). Chartist behavior, which can also be 
driven by imitation, is known to cause market instabihty. Fundamentalists act instead 
as a restoring force that dumps market inefficiencies and excess volatility. The next 
question we address concerns the macroscopic properties of models in which contrarians 
and trend- followers interact. 

As usual, wc start from the simple model with no information. Let us assume 
that a fraction / of agents arc trend followers whereas the remaining (1 — f)N are 
fundamentalists. The dynamics is governed by the following scheme: 

Prob{6i(t) = h} = C exp [feA^ (t)] (204) 

A,(i + 1) - A,(t) = eiTA{t)/N (205) 

where = 1 for trend-followers (say for i G {!,..., /A^}) and = —1 for 
fundamentalists (say i G {fN -|- 1, . . . , A^}). Assuming that Aj(0) = for simplicity, 
we can approximate A{t)lN with its average and see that the dynamics of y{t) = 
Ai{t) - Ai{0) = A,(t) is given by 

y{t + 1) - y{t) = (2/ - l)r tanh[|/(t)] (206) 

Linear stability analysis of (206) leads to the following scenario. For / < 1/2 we have 
two regimes: 

• for r < the fixed point y* = is stable. One has {A) = and cr^ = 0{N) as 
in the information-free Minority Game with subcritical F 

• for F > the fixed point y* = is unstable. One has (A) = and = 0{N^) 
as in the information-free Minority Game with supercritical F 

For / > 1/2 instead the fixed point y* = Ois unstable and the solution y{t) = yo+vt with 
V — ±(2/ — 1)F appears. Here, both trend-followers and contrarians behave coherently: 
bi{t) = b for alH G {1, . . . , fN} and bi{t) = -b for all i G {fN + 1,...,N}. As a result, 
(A) is either (2f — l)N or (2f — l)N and cr^ = 0(N'^) as in the information-free Majority 
Game. The conclusion we draw is that the expectations of the majority group (be it 
fundamentalists or trend-followers) are fulfilled in the steady state. This is confirmed 
by studying the autocorrelation of returns as a function of / in the steady state, see 
Fig. 26. 

This conclusion extends to the full model, whose properties have been analyzed 
in [68]. The mixed Majority- Minority Game is defined by 

gi{t) = arg max Uig{t) 
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Figure 26. (from Ref. [44]) Autocorrelation of returns as a function of the fraction / 
of fundamentalists in the market. Autocorrelation is taken in the stationary state of 
a system of TV = 10^ agents with T = 2.5. Arrows mark the transitions between the 
three regimes described in the text, which occur at / = 0.5 and at / = 0.9. The inset 
shows a detail of the central part of the graph. 



(207) 



U,,{t+l)-U,,{t)^e,ai:^'^A{t)/N 

where as before = 1 for trend-followers (or i G {1,...,/A^}) and = —1 for 
fundamentalists (or i e {fN -\- 1,...,A^}). The statistical mechanics of this model 
is slightly more involved than previous cases. As before, one finds that the steady 
state can be characterized in terms of the microscopic variables uii = (sign(yj)) where 
Uii^) — ~ Ui2{t)]. In particular, the stationary m^'s for can be obtained by 

solving the following problem: 



max min H{mi,m2) 

m2 mi 



where 



H{mi,m2) 



1 



mi 



(208) 



(209) 



and mi (resp. 7712) denote collectively the variables of Minority (resp. Majority) 
game players. Hence the mixed game where both minority and majority players 
are present at the same time requires a minimization of the predictability in certain 
directions (the minority ones) and a maximization in others (the majority ones). It is 
possible to tackle this type of problem by a replica theory [69] . The idea is to introduce 
two 'inverse temperatures' (5i and (52 for minority and majority players respectively, 
such that [68] 



max min H(mi,m2) — lim 

m2 mi ^i,/32— >oo 



-{{l0gZ{P„P2))) 

P2 



(210) 
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with the following generalized partition function: 



dm2 e'' 



ft log/ dmi e-ft^ 



dmi e"^!^ 



(211) 



(213) 



where 7 = /32//3i > 0. In physical jargon, this describes a system where: first, the mi 
variables are thermalized at a positive temperature l//5i with Hamiltonian H at fixed 
1712] then, the ra2 variables are thermalized at a negative temperature — 1//?2 with an 
effective Hamiltonian i/cfr defined by —l3iHcs{m2) = log J drrii e~^^^ . The disorder 
average can be carried out with the help of a 'nested' replica trick. First, one replicates 
the minority variables by treating the exponent —7 as a positive integer R (in the end, 
the hmit R —7 < must be taken). (211) thus becomes 

I dm2 I dmi e-^i^ = j dm2 j e"^! S^^«"^J>'"^) \[ dm\ (212) 

J iJ J ^ r=l,R 

Then a second replication is needed, this time on the m2 variables: 
Z^' ^ I e-/^ii:„,.w({mr},{m§}) -Q -Q dmrdml 

a=l,R' r=l,R 

At this point we have two replica indexes with different roles: the replicas labeled a 
have been introduced to deal with the disorder, and their number R' will eventually go 
to zero, as usual; the replicas labeled r have been introduced to deal with the negative 
temperature, and their number R must be set to a negative value. Majority variables 
bear just one index, while minority ones have two. We can interpret this fact by saying 
that rUg indicates a particular configuration of the majority variables, i.e. a given 
manifold in the whole m space; and indicates the minority coordinates in that 
particular manifold. Notice that the min and max operations and hence the meaning 
of coordinates in the above interpretation can be interchanged. In general, this leads to 
different solutions. In our case, however, one can verify that the main results would not 
change, though the intermediate steps (e.g. the definition of 7) would vary. 

Following the procedure outlined above it is possible to calculate the phase diagram 
of the model (Fig. 27), namely the line of critical points adf) for different values of 
/ separating the asymmetric, information-rich phase {a > adf)) from the symmetric, 
unpredictable regime (a < adf)). One sees that the efficient regime shrinks as the 
fraction of trend-followers increases until, for / = 1/2 it disappears. Now trend- followers 
are the majority group and the market becomes completely predictable. The dynamical 
calculation clarifies the phase transition further by relating the critical line to the onset 
of ergodicity breaking. 

While this model captures one of the basic effects of the presence of trend- followers 
in the market, namely a decrease in efficiency, it is clear that the properties of mixed 
games are to some extent a linear combination of those of pure games and thus a 
gross simplification with respect to a realistic case. Now it is reasonable to think that 
real traders may revise their expectations if they prove wrong or simply may want to 
weigh their decisions against other factors than the expected profit. For instance, in 
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certain market regimes (e.g. bubbles) a trader could perceive the market as a Majority 
rather than Minority Game and consequently switch from a fundamentalist to a trend- 
following behavior. Similarly, in situations of high volatility traders would likely take 
into account the risk factor when choosing a trading strategy over another. How would 
the macroscopic properties of the Minority Game change if agents were allowed to modify 
their behavior and expectations according to the market conditions they perceive? 

This issue may be tackled through the introduction of a more general MG setting 
with the rationale that traders prefer to adopt a trend-following attitude, and thus 
perceive the market as a Majority Game, when fluctuations are small while they revert 
to fundamentals, and hence perceive the market as a Minority Game, when the price 
dynamics becomes more chaotic [70,71]. This mechanism leads to a surprisingly rich 
phenomenology which includes the formation and disruption of trends and the emergence 
of 'heavy tails' in the returns distribution. The model is defined through 

gi{t) = arg maxC/jg(t) 

^W = E<:w (214) 

i 

Ui,{t-rl)-Ui,{t)^4'^Fi[A{t)\ 

where the function Fj embodies the way in which agent i perceives the performance of 
his/her g-ih. trading strategy in the market. For simplicity we shall henceforth assume 
that Fi = F for all i. Clearly, F(A) = —A for a Minority Game whereas F(A) — A for 
a Majority Game. The case we consider is 

F{A) ^A-eA^ (215) 

with e > 0. For e = one has a pure Majority Game. Upon increasing e, the non-linear 
gains importance, and for e — > oo one obtains a Minority Game with F{A) oc —A^. A 
couple of remarks are in order. 
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Figure 28. Normalized return autocorrelation function as a function oi a ~ P/N 
for different values of e (left) and probability distributions P{A) of ^ > for different 
values of e for a = 0.05 (top right) and a — 2 (bottom right). From [70]. 



(i) This mechanism is expected to induce a feed-back in the dynamics of the excess 
demand: when it is small, trend-followers dominate and drive it to larger values 
until fundamentalists eventually take over and drive it back to smaller values. 

(ii) It is reasonable to think that e should fluctuate in time and possibly be coupled 
to the system's performance. A possible microscopic mechanism is the following. 
When e is large a high volatility is to be expected as agents are more likely to behave 
as trend-followers. As a consequence, they should likely reduce their threshold since 
the market is risky; however, for small e fundamentalists are expected to dominate 
and the game should acquire a Minority character. Hence the predictability will be 
smaller and there will be less profit opportunities. Agents may then decide to adopt 
a larger threshold to seek for convenient speculations on a wider scale. If these two 
competing effects are appropriately described by an evolution equation for e, the 
system should self-organize around an 'optimal' value of the parameter. However 
such a time evolution should take place on time-scales much longer than those 
which the model addresses (intra-day/daily trading) and hence it is reasonable to 
study the case of fixed e. 

It turns out (see Fig. 28) that while for low enough (resp. high enough) e the behavior 
of a pure Majority (resp. Minority) game is recovered (with some qualitative differences 
due to the unconventional nature of the MG in this case), there exists a range of values 
of e for which the two tendencies coexist and one can cross over from one to the other 
by changing a and/or e. This can be seen from the behavior of the (normalized) 
autocorrelation D = {A{t)A{t + 1)) /a^ as a function of a. The crossover gets sharper 
and sharper as a increases and turns into a sharp threshold for a ^ 1. In this case, the 
threshold can be estimated analytically. Indeed one has 

V, ^ {y,{t + 1) - y,{t)) = WWiW) (216) 

As usual, if Vi ^ 0, then ~ Vit and Si(t) tends asymptotically to sign(t>j): there is a 
well defined preference towards one of the two strategies and the agent becomes frozen. 
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For large a, we can approximate A{t) with a Gaussian random variable with variance 
H. By virtue of Wick's theorem, this imphes that {A^\lj) — 3i? {A\ijl), so 

Vi^{l- 3eH)Ct{A\f,) (217) 

If 1 — SeH > 0, the agents' spins will freeze on the Majority-type solution Si — 
sign(^f which is unstable for 1 — SeH < 0. Given that if = 1 for large a, 

we see that the crossover from the Majority- to the Minority-regime takes place at 
e ~ 1/3 for q; ^ 1, which is significantly close to the numerical value of Sc — 0.37. 

For small a, when the contribution of frozen agents is small, we expect the system 
to self-organize around a value of A such that F{A) = 0: indeed one can sec from Fig. 
28 that the peak of the distribution moves as Besides, as e increases, large excess 

demands occur with a finite probability. The emergence of such 'tails' in P{A), while not 
power-law, is a clear non-Gaussian signature. The dynamics in this regime is particularly 
interesting: while the market is mostly chaotic and dominated by contrarians, 'ordered' 
periods can arise where the excess demand is small and trends are formed, signaling 
that chartists have taken over the market. These trends, that can be arbitrarily long, 
eventually eventually die out restoring the fundamentalist regime. 

In order to understand the full impact of trend-followers it is however necessary to 
emply endogenous information [71]. Indeed, one identifies two regimes in an intermittent 
market dynamics. Phases with small fiuctuations, dominated by contrarians and in 
which the information dynamics is roughly ergodic over the possible patterns, are 
followed by phases with large fluctuations dominated by trend-followers, where the 
information dynamics is strongly non-ergodic (actually a single information pattern 
is dynamically selected). 

5.6. Markets with asymmetric information 

A crucial assumption in all models we have been dealing with so far is that all agents 
possess the same information, be it the real price time series or the bar attendance 
sequence or a random integer. As long as all agents process the same information 
pattern the system can reach some level of coordination and a more or less complicated 
phase structure arises. Unfortunately, it is hard to believe that all agents in real systems 
possess the same information. This brings us to the question: how are the coordination 
properties affected when the information is private, i.e. agent-dependent? 

This question is indeed of fundamental theoretical importance. A substantial part 
of economic theory is based on the assumption that markets are informationally efficient. 
Roughly speaking, a market is efficient with respect to an information set if the public 
revelation of that information would not change the prices of the assets. In other words, 
this means that all the relevant information is incorporated into prices. This includes 
both public and private information. However, it has been understood [Akerlof] that 
asymmetric information may cause inefficiency of the equilibrium, given the strategic 
incentive of each agent not to reveal the information he has. The salvation comes from 
the system size: in fact this nefarious effect may vanish in large markets, since the 



CONTENTS 



81 



single bits of information possessed by an individual agent become less significant the 
larger is the number of agents. Hence, the common understanding is that prices reflect 
information more accurately in large systems. 

To conclude our review, we shall now discuss a model in which the above scenario 
emerges as a phase transition between an informationally efficient phase and an 
informationally inefficient one [72]. The control parameter is, as in the MG, the ratio 
between the size of the information space and the number of traders. 

We consider a market with one asset. The market can find itself in any of P states 
of the world, labeled ji. The return of the asset depends on the state of the world only, 
and is denoted by W^. We assume that each is given by 
j-t^ 

R^' = R+^ (218) 



where the arc independent samples of a Gaussian random variable with zero mean 
and variance s drawn at time t = and fixed (quenched disorder). We further assume 
that at each time step the state of the market is drawn randomly and independently from 
{1, . . . ,P} with equal probability. This process determines the time series of returns 
{i?'*(*)}t>o completely. 

N traders act in this market. They have no information concerning the state of 
the world but rather they observe a coarse-grained signal on the information space 
{1, . . . , P}. We denote it as a vector 

kr.{i,...,P}3i^^Ke{-i,l} (219) 

in which every state of the market is associated to a particular value of a binary variable 
(in other words, an agent cannot tell which state the market is in but only knows whether 
it is an "up state" or a "down state"). Different agents receive different signals, as each 
component kf of every vector fcj is taken to be drawn randomly and independently from 
{— 1, 1} with equal probability for all i and This defines the private information 
structure. Note that if an agent knew simultaneously the partial information of all 
agents he would be able to know the state /j, with probability one, for N ^ oo. 

At each time step, traders i has to decide an investment. Let Zi{t) denote the 
amount of money he decides to invest (buying or selling) at time t. We assume that the 
price at time t, p{t) is fixed by a market clearing condition, in which the demand of the 
asset is determined by the aggregate money invested and the supply is fixed at N: 

^Y,z,{t)=p{t) (220) 

i 

We further assume that Zi{t) depends on whether his information fcf''*'' about the state 
is "up" or "down": Zi{t) = Xlmel-i i| -^r(^)^fc''(*) • ^^is way the price depends on 
the state since the amount invested by each agent depends on the state: p{t) = 

At the end of each period t, each unit of asset pays a monetary amount R^^^*\ If 
agent i has invested Zi{t) units of money, he will hold Zi{t)/p{t) units of asset, so his 
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payoff will be Zi{t){^^ ~ !)■ It follows that the expected payoff is given by 



(222) 



= ^ E E ^K^rnZT (f - l) = E ^^^^rnZT (f - l) (221) 
M mG{-l,l} ^ me{-l,l} ^ 

Every agent aims at choosing the z™'s so as to maximize their expected payoff. We 
consider inductive agents who repeatedly trade in the market. Each agent i has 
a propensity to invest Ul^{t) for each of the signals m e {—1,1}. His investment 
= Xi{Ul^) at time t is an increasing function of U™-{t) (x^ : R — > Rq ) with Xi{^) ~^ 
if X — > — oo and Xi(x) — > oo if x — > oo (a convenient choice for numerical experiments 
is — Ul^9{Ul^)). After each period agents update U'^{t) according to the marginal 
success of the investment: 

mt + 1) = ur{t) + r^,,,) „^ [i?(t) - p{t) - 

The idea is that if the return is larger than the price, the agent's propensity to invest in 
that signal increases, otherwise it decreases. The rj term provides the distinction between 
naive (or price-taking) agents (77 = 0), who are unaware of their market impact, and 
"sophisticated" traders (?] = 1) who instead are able to disentangle their contribution 
to the price exactly. F > is a parameter (In [72] the dynamics (222) is obtained from a 
more properly justified process involving the marginal utility of a certain investment.). 

As a measure of coordination we employ the distance between prices and returns 
in the steady state: 

H ^\R-pf = J2iR^ -P^f (223) 

Clearly, ii H = prices follow returns and hence incorporate the information about the 
states of the world, so that the market is informationally efficient. 

Numerical results for the stationary i7 as a function of o; = P/N for rj — and 
Tj = 1 (and r small enough) are given in Fig. 29. 

Let us start from naive traders {rj = 0). As the number of agents increases, i.e. 
as a — P/N decreases, agents are collectively more efficient in driving prices close to 
returns. Indeed the distance H decreases as a decreases. The price-return distance 
vanishes at a critical point etc which turns out to mark a second order phase transition 
in the statistical mechanics approach. The value of etc depends on the intensity s of 
fluctuations of returns. The region «< cic is characterized by the condition H — 0, 
which means = i?^ for all p. This means that the market efficiently aggregates 
the information dispersed across agents into the price. It can be shown that the 
efficient phase, where H = 0, shrinks as s increases. This is reasonable because as the 
fluctuations in increase, it becomes harder and harder for the agents to incorporate 
them into prices. This behavior can be understood analytically as usual by constructing 
the continuous-time limit of (222). It turns out that if is a Lyapunov function of 
the dynamics: price takers cooperate to make the market as informationally efficient 
as possible. Prom the agent's point of view the steady states in the efficient phase 
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Figure 29. Behaviour of H/a versus a for rj = (left; u{0) is the initial bias in 
the score functions) and various i] > (right: rj = 0.05 (circles), rj = 0.25 (squares), 
T] = 0.5 (diamonds) and r] = 0.75 (triangles). From [73]. 



{a < etc) are not unique and the state in which agents will end up depends on the 
initial conditions {U^(t = 0)} (prices, of course, do not depend on the initial condition, 
because — for all fi). It can also be shown that these steady states in which H 
is minimum correspond to competitive equilibria, namely configurations obtained when 
agents choose their investments a priori by solving 



max xSk,,m (^^ - 1 j (224) 

for m G { — 1, 1}, namely by maximizing their expected profits. 

Turning to sophisticated agents {rj = 1), one sees that the phase transition 
disappears: the distance between prices and returns smoothly decreases as a decreases 
and it vanishes only in the limit ck — > 0. Moreover, the steady state is unique in both 
prices and investment for all a > 0: the asymptotic behavior of learning dynamics does 
not depend on initial conditions. It can be shown that the steady state in this case is 
a Nash equilibrium, that is it corresponds to all agents choosing their investments by 
solving 



max X J2 ^K,ra „^ - 1 (225) 



for m e {—1, 1}, where pt^ = p'^ — ^mg{_i ij, ^kfmZi^lN is the contribution of all other 
agents to the price (in other words, each trader disentangles his contribution from the 
price and optimizes the response to all other traders). 

These findings defy the intuition that Nash equilibria behave similarly to 
competitive equilibria when N ^ oo. Another striking proof of the difference between 
the two equilibrium concepts is given by the quantity 



1 / + - \ 2 

-L \ ^ / Z- — Zi 



(226) 




a 

Figure 30. Behaviour of q versus a for 77 = (left; m(0) is the initial bias in the score 
functions) and various rj > Q (right: 77 = 0.05 (circles), 77 = 0.25 (squares), 77 = 0.5 
(diamonds) and 77 = 0.75 (triangles). Prom [73]. 



which measures how differently agents invest under the two signals, i.e. how much 
they use the information they possess (Fig. 30). Price takers exploit their signals 
much more than sophisticated traders, who invest very similar amounts of money in the 
two states they distinguish. Note that for 77 = the steady state depends on initial 
conditions below ac- The efficient/inefficient transition may then be characterized also 
dynamically trhough transition via path-integral methods [73]. 

There are several other aspects of the model that deserve attention, starting with 
the dependence of fluctuations on F. We refer the interested reader to [72,73] for a more 
detailed discussion. 

6. Conclusions 

Compared to reality, the models discussed in this review have a marked theoretical 
nature. The aim of these models is not that of providing quantitative predictions 
but rather to understand under what conditions the rich variety of behaviors, ranging 
from anomalous fluctuations to spontaneous coordination, may emerge in a simplifled 
controllable setting. This is a complementary approach to that of empirical analysis, 
which has been dominating the scene of interdisciplinary ventures of statistical physicists 
into economics and finance. Indeed, a proper understanding of how interaction 
propagates from the micro to the macro scale, is crucial in many cases in order to 
infer what empirical analysis should focus on. 

Here we have reviewed a number of models with N heterogeneous interacting 
agents - be they firms, species, drivers or traders - who compete for the exploitations 
of a number P of resources. The collective behavior of all these systems belongs to 
the same generic phenomenology, as discussed in Secion 2.1. A key parameter is the 
ratio {a — P/N) between the number of resources and the number of agents, and the 
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central quantities of interest are the (in)efficiency o"^, which is related to the amount of 
unexploited resources, and the unevenness H with which resources are exploited. 

The collective behavior depends strongly on whether agents account or not for their 
impact on the resources. This is somewhat surprising, as one would expect that in the 
limit iV — > oo, the contribution of each agent to the exploitation of each resource is 
vanishing. For the ease of exposition, we distinguish between the two extreme case 
of competitive equilibria (CE) and Nash equihbria (NE), where agents fully neglect 
or account exactly for their impact, respectively. The stationary state of the learning 
dynamics which converges to these equilibria, in Minority Game type models markedly 
differ in the following respects: 

Equilibrium condition In CE resources are exploited, on average, as evenly as 
possible, i.e. H is minimal. In NE fluctuations or wastes are as small as possible 
(i.e. (7^ is minimal). 

Phase transition A phase transition occurs in CE when the number of agents exceeds 
a critical one, i.e. when a < a^- This separates an asymmetric [H > for a > a^) 
from a symmetric (if = for o; < a^) phase. No phase transition takes place in 
NE (i.e. > for all a > 0) 

Degeneracy The stationary state is unique in CE for a > ac and it is degenerate on 
a continuous set for a < a^. There is an exponential number of disjoint NE. 

Initial conditions The stationary state docs not depend on initial conditions for CE 
and a > etc and it depends continuously on initial conditions for a < etc- The NE 
to which the system converges depends discontinuously on initial conditions. 

Fluctuations Agents' behavior is stochastic in CE (i.e. cr^ > H) whereas it is 
deterministic (cr^ = H) in NE. Put differently, in NE agents always play a single 
strategy, whereas in CE agents switch between different strategies. 

Number of choices Giving more strategies to agents improves coordination in NE but 
it can make agents worse off in CE (typically when a is small). 

Convergence Agents converge fast to CE whereas agents may fail to learn to 
coordinate on NE [74] 

Not all these conclusions apply to the asset market model with private information of 
Sec. 5.6, though even there CE and NE differ substantially [72]. 

There still remain interesting theoretical challenges in this field. Some of these are: 

• The MG is a prototype model of a systems where the collective fluctuations which 
agents produce feed back into their dynamics. Still, there are no analytical tools 
which allows us to characterize this feedback in precise terms in the symmetric 
phase of the MG, i.e. to compute the volatility as a function of F. 

• MG based models of financial markets show that anomalous fluctuations similar to 
the stylized facts observed in real markets arise close to the phase transition line. 
Still the critical properties at this phase transition have not yet been characterized. 
Detailed numerical studies of critical properties or analytic approaches based on 
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renormalization group techniques would be very important to shed hght on this 
issue. 

• The MG suggests that real markets operate close to a phase transition but it does 
not explicitly describe a mechanism of how markets would "self-organize" to such 
a state. Though some arguments have been put forward [57], these have not yet 
been formalized in a definite model. 

• The extensions to cases where firms behave strategically, as in Cournot games [2], 
of the model of economic equilibria may prove interesting. The conjecture is that, 
even in the limit A'^ ^ oo if the number of commodities (or markets) also diverges, 
the NE may be markedly different from a CE. 

As a concluding remark, we observe that socio-economic phenomena have features 
which are markedly different from those addressed in natural sciences. Above all, the 
economy and society change at a rate which is probably much faster than that at which 
we understand them. For example, many of the things which are traded nowadays in 
financial markets did not exist few decades ago, not to speak of internet communities. In 
addition, we face a situation in which the density and range of interactions are steadily 
increasing, thus making theoretical concepts based on effective non-interacting theories 
inadequate. 

Definitely, socio-economic systems provide several interesting theoretical challenges. 
Our hope is that these effort will help refine our understanding of how individual 
behavior, interaction and randomness may conspire in shaping collective phenomena, 
which, broadly speaking, is the aim of statistical physics. 
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